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(57) Abstract 

The present invention relates to 186 novel human secreted proteins and isolated nucleic acids containing the coding regions of the 
genes encoding such proteins. Also provided are vectors, host cells, antibodies, and recombinant methods for producing human secreted 
proteins. The invention further relates to diagnostic and therapeutic methods useful for diagnosing and treating disorders related to these 
novel human secreted proteins. 
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186 Human Secreted Proteins 
Field of the Invention 

This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
5 their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 

10 organelle, contains different proteins essential for the function of the organelle. The cell 
uses "sorting signals," which are amino acid motifs located within the protein, to target 
proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 

15 (ER). The ER separates the membrane-bounded proteins from all other types of 

proteins. Once localized to the ER, both groups of proteins can be further directed to 
another organelle called the Golgi apparatus. Here, the Golgi distributes the proteins to 
vesicles, including secretory vesicles, the cell membrane, lysosomes, and the other 
organelles. 

20 Proteins targeted to the ER by a signal sequence can be released into the 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the extracellular 
space - a process called exocytosis. Exocytosis can occur constitutively or after receipt 
of a triggering signal. In the latter case, the proteins are stored in secretory vesicles (or 

25 secretory granules) until exocytosis is triggered. Similarly, proteins residing on the cell 
membrane can also be secreted into the extracellular space by proteolytic cleavage of a 
"linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 

30 the conunercially valuable human insuhn, interferon. Factor VIII, human growth 
hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of the 
pervasive role of secreted proteins in human physiology, a need exists for identifying 
and characterizing novel human secreted proteins and the genes that encode them. This 
knowledge will allow one to detect, to treat, and to prevent medical disorders by using 

35 secreted proteins or the genes that encode them. 
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Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, antibodies, 
5 and recombinant methods for producing the polypeptides and polynucleotides. Also 
provided are diagnostic methods for detecting disorders related to the polypeptides, and 
therapeutic methods for treating such disorders. The invention further relates to 
screening methods for identifying binding partners of the polypeptides. 

10 Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
15 environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered **by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 
20 In the present invention, a "secreted" protein refers to those proteins capable of 

being directed to the ER, secretory vesicles, or the extracellular space as a result of a 
signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
extracellular space, the secreted protein can undergo extracellular processing to produce 
25 a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence 
30 of the fiill length cDN A sequence, including the 5* and 3' untranslated sequences, the 
coding region, with or without the signal sequence, the secreted protein coding region, 
as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having the translated 
amino acid sequence generated from the polynucleotide as broadly defined, 
35 In the present invention, the full length sequence identified as SEQ ID NO:X 

was often generated by overlapping sequences contained in multiple clones (contig 
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analysis). A representative clone containing all or most of the sequence for SEQ ID 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
shown in Table 1 , each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 12301 Park Lawn Drive, Rockville 
5 Maryland 20852. USA. The ATCC deposit was made pursuant to the terms of the 
Budapest Treaty on the international recognition of the deposit of microorganisms for 
purposes of patent procedure. - 

^ A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained 
1 0 in SEQ ID NO:X, the complement thereof, or the cDNA contained within the clone 
deposited with the ATCC. "Stiingent hybridization conditions" refers to an overnight 
incubation at 42° C in a solution comprising 50% formamide, 5x SSC (750 mM NaCl. 
75 mM sodium citrate). 50 mM sodium phosphate (pH 7.6), 5x Denhardfs solution, 
10% dexu-an sulfate, and 20 ^g/ml denatured, sheared salmon sperm DNA. followed 
1 5 by washing the filters in 0. Ix SSC at about 65''C. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished tiirough the manipulation of formamide concentration (lower percentages 
20 of formamide result in lowered stringency); salt conditions, or temperature. For 
example, lower stringency conditions include an overnight incubation at 37^C in a 
solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH^PO,; 0.02M EDTA, 
pH 7.4), 0.5% SDS. 30% formamide, 1 00 ug/ml salmon sperm blocking DNA; 
followed by washes at 50°C with IXSSPE. 0.1% SDS. In addition, to achieve even 
25 lower stringency, washes performed following stringent hybridization can be done at 
higher salt concentrations (e.g. 5X SSC). 

Note that variations in tiie above conditions may be accompUshed tiirough the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
30 Denhardfs reagent. BLOTTO, heparin, denatured salmon sperm DNA. and 

commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, due 
to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
35 as any 3' terminal polyA+ ti-act of a cDNA shown in tiie sequence listing), or to a 
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complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide/' since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone). 
5 The polynucleotide of the present invention can be composed of any 

polyribonucleotide or polydeoxribonucleotide, which may be unmodified RN A or DN A 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 
and double-stranded DNA, DNA that is a mixture of single- and double-stiranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and 
10 doublerstranded regions, hybrid molecules comprising DNA and RNA that may be 

single-stranded or, more typically, double-stranded or a mixture of single- and double- 
stranded regions. In addition, the polynucleotide can be composed of triple-stranded 
regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also 
contain one or more modified bases or DNA or RNA backbones modified for stability 
15 or for other reasons. "Modified" bases include, for example, tritylated bases and 

unusual bases such as inosine. A variety of modifications can be made to DNA and 
RNA; thus, "polynucleotide" embraces chemically, enzymatically, or metabolically 
modified forms. 

The polypeptide of the present invention can be composed of amino acids joined 
20 to each other by peptide bonds or modified peptide bonds, i.e.. peptide isosteres, and 
may contain amino acids other than the 20 gene-encoded amino acids. The 
polypeptides may be modified by either natural processes, such as posttranslational 
processing, or by chemical modification techniques which are well known in the art. 
Such modifications are well described in basic texts and in more detailed monographs, 
25 as well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Polypeptides may be 
30 branched , for example, as a result of ubiquitination, and they may be cyclic, with or 
without branching. Cyclic, branched, and branched cyclic polypeptides may result 
from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
35 nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization. disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine. 
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formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation. sulfation, transfer-RNA mediated addition of amino acids to proteins 
5 such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 

STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. 
H. Freeman and Company, New York (1993); POSTTRANSLATIONAL 
COVALENTMODDFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 
Press, New York, pgs. 1-12 (1983); Seifter et al., Meth Enzymol 182:626-646 (1990); 
10 Rattanetal., Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO:Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table 1. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
15 activity similar, but not necessarily identical to, an activity of a polypeptide of the 

present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it 
need not be idenUcal to that of the polypeptide, but rather substantially similar to the 
dose-dependence in a given activity as compared to the polypeptide of the present 
20 invention (i.e.. the candidate polypeptide wiU exhibit greater activity or not more than 
about 25-fold less and, preferably, not more than about tenfold less activity, and most 
preferably, not more than about three-fold less activity relative to the polypeptide of the 
present invention.) 

25 Polynucleotides and Polvnentides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

This gene is expressed primarily in testes tumor and to a lesser extent in fetal 

brain. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly of the testes, and defects of tiie central nervous system 
such as seizure and neurodegenerative disorders. Similarly, polypeptides and antibodies 

35 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly cancer of die testes and central nervous system. 
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expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., testes and other reproductive tissue, brain and 
other tissue of the nervous system, and blood cells, and spleen, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

5 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of testicular cancer and 

10 treatment of central nervous system disorders since this gene is primarily expressed in 
the testes tumor and developing brain. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

This gene is expressed primarily in cancer tissues, such as breast cancer and 

15 Wilm*s tumor, and to a lesser extent in fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, and/or tumors, particularly, those found in the breast, and developmental 

20 abnormalities or disorders. Similarly, jpolypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the glandular tissues, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., mammary 

25 tissue, and fetal tissue and, cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 

30 sequence shown in SEQ ID NO. 314 as residues: Pro- 1 1 to Thr-18, Leu-43 to Pro-50, 
Gly-64 to Leu-72, and Leu-81 to Lys-86. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of cancers and/or tumors, 
particularly, those found in the breast since expression is mainly in cancer/tumor 

35 tissues. May serve as therapeutic proteins for proliferation/differentiation of fetal 
tissues. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

This gene is expressed primarily in CD34 depleted buffy coat and to a lesser 
extent in spleen, chronic lymphocytic leukemia. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood disorders or 
leukemias, diseases of the immune system. Similarly, polypeptides and antibodies _ 
directed to these polypeptides are useful in providing inmiunological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, and spleen, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

15 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders or 

20 leukemias, diseases of the immune system since expression is in tissues related to 
inmiune function. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

This gene is expressed primarily in CD34 depleted buffy coat. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood disorders or 
lymphocytic diseases. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

30 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inunune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 

35 having such a disorder, relative to the standard gene expression level, i.e., the 

expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders since 
expression is in tissues related to inrunune function. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

This gene is expressed primarily in CD34 depleted buffy coat. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood or immune 

10 diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., blood cells, and cancerous 

15 and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO. 317 as residues: 

20 Pro-13 to Lys-21. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders since 
expression is in tissues related to immune function. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

This gene is expressed primarily in CD34 depleted buffy coat. 
Therefore, polynucleotides and polypeptides of the invention are useful as ' 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood or immune 
diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., and blood cells, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
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The ,i«ue distribution indicates that polynueleotides and polypepUdes 
5 eotrespondtng to this gene ate useM for .«at.ent/diagnosis of bio<^Lses since it 
IS expressed m tissues .elated to inunune function. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

primarily in CD34 depleted buffy coat and to a lesser 
10 extent in pineal gland. ^luioaiesser 

^„ ,TTr ■^'"""""'"'^ polypeptides of the invendon are useful as 
^agenu for dtfTetenaal identiftcation of the tissue(s) or cell typefs) present in a 
bt^ogtcal sample and for diagnosis of diseases and co„diti„rl;Ls of fte i^^une 
^ste™ and b^n associated diseases. Similarly, polypeptides and antibodies dirrjio 
th^ polypepud^ ate useft.1 in providing inununological probes for differentiai 
denuficauon of the tissue(s) or cell type(s). For a number of disorders of the above 
rr °',r r — ^^^-e-. of t^s gene at 

Wes (e.g blood cells, and ptneal gland, and cancerous and wounded tissues) or 
bodtly flmds (e.g., sen,m, plasma, urine, synovial nuid or spinal Huid) or another 
tissue or celi sample taken (ron. an individual having such a disc^ier, relative to ti.e 
— gene exp^sston level, i.e., the expression level in healthy tissue or bodily 
fluid ftom an indtviduai not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
TOnding to this gene are useftil for ueatment/diagnosis of blood dLrders 
immune diseases or brain associated diseases (specifically of the pineal gland) since 
expression ism tissues related to immune function. 



15 



20 



30 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO- 8 

The translation product of tiiis gene shares sequence homology with an organic 
^tion trai^poter which is tiiough. to be important in organic cation ! take in Uie 
icdney and hver. (See Accession No. 2343059.) Preferred polypeptide fragments 
comprise the amino acid sequence r™iQM,CI.VNXELYFIFVRNXOVMVCSSU:D,GGlrn. 

VOTSEPSOT (SEQ ID NO: 615) or TMKDAenlorkakpkent (SEQ id no- 616) as well 
as N-ternuna a^d C-termina, deletions of these fragment. Also pLired ai^ 
polynucleotide fragments encoding these polypeptide fragments. 
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This gene is expressed primarily in liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the Ussue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hepatic and renal 
5 diseases where drug elimination/cation exchange (organic cation uptake) in the liver and 
kidney are problematic. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hepatic or renal system, expression of this gene at significanUy higher 
1 0 or lower levels may be routinely detected in certain tissues and cell types (e.g., kidney 
and liver, arid cancerous and wounded tissues) or bodily fluids (e.g., serom, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
1 5 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 320 as residues: Asn-64 to Asn-74, and Gln-81 to Gly-87. 

The tissue distribution and homology to organic cation transporter indicate that 
polynucleotides and polypeptides corresponding to this gene are useful as a polyspecific 
transporter that is important for drug elimination in the liver (and possibly kidney) since 
20 expression is found in the liver. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

This gene is expressed primarily in eosinophil induced with IL-5 and to a lesser 
extent in fetal liver and spleen. This gene also maps to chromosome 15, and therefore 
25 can be used in linkage analysis as a marker for chromosome 15. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diseases of the immune 
system, particularly allergies or asthma. Similarly, polypeptides and antibodies directed 
30 to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the inunune system, expression of this gene at 
significantiy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g.. blood ceUs, liver, and spleen, and cancerous and wounded tissues) or 
35 bodily fluids (e.g.. senim. plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
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Standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribuUon indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating/diagnosis of diseases involving 
5 esosinphil reactions since expression seems to be concentrated in eosinophils and other 
tissues involved in immunity such as the liver and spleen. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

This gene is expressed primarily in tissues of hematopoietic lineage and to a 

10 lesser extent in Hodgkins lymphoma. Any frame shifts in this sequence can easily be 
clarified using known molecular biology techniques. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

1 5 not limited to^ and immune deficiency or dysfunction. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of tiiis gene 
at significantiy higher or lower levels may be routinely detected in certain tissues and 

20 cell types (e.g., hematopoietic cells, lymphoid and reticuloendothelial tissues, and 

cancerous tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

25 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment/ diagnosis for lymphomas or 
immune dysfuction or as a therapeutic protein usefiii in inmiune modulation based on 
expression in anergic T-cells and lymphomas. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

This gene is expressed primarily in neutrophils and to a lesser extent in activated 
lymphoid cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the cell type present in a biological sample and 
35 for diagnosis of diseases and conditions: inflamation. Similarly, polypeptides and 

antibodies directed to these polypeptides are usefiil in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
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of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., blood cells and lymphoid tissue, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
5 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO. 323 as residues: Glu-40 to Lys-46. 
The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for modulation of an immune reaction or as a 
growth factor for the differentiation or proliferation of neutrphils for the treatment of 
neutropenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

15 This gene is expressed primarily in brain and to a lesser extent in activated T- 

cells. It is likely that the open reading frame containing the predicted signal peptide 
continues in the 5* direction. Preferred polypeptide fragments comprise the amino acid 
sequence PRVRNSPEDLGLSLTGDSCKL (SEQ ID NO:6 17). 

Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neurodegenerative 
disorders including ischemic shock, alzheimers and cognitive disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 

25 number of disorders of the above tissues or cells, particularly of the central nervous 

system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., blood cells, and brain, and other tissue of 
the nervous system and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

30 an individual having such a disorder, relative to the standaird gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 324 as residues: Ser-5 to Glu-14, Ile-21 to Pro-35, Ser-65 to Asp-81, Cys-89 to 
Val-96, Lys-136 to Ser-145, Ile-152 to Met-169, and Arg-189 to Lys-196. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnostic/treatment for cancers of the given 
tissue or in the treatment of neurological disorders of the CNS. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 13 
^ This gene was also recently cloned by other groups, naming this calcium-activated 
potassium channel gene, hKCa4. (See Accession No. AF033021, see also. Accession 
5 No. 2584866.) This gene is mapped to human chromosome 19ql3.2. A second signal 
sequence likely exists upstream from the predicted signal sequence as described in 
Table 1. Preferred polypeptide fragments comprise: QADDLQATVAALCVLRGGGPWAG 
SWLSPKTPGAMGGDLVLGLGALRRRKRLL (SEQ NO: 61 8); or EQEKSLAGWALVLAXXGIGL 
MVLHAEMLWFGGCSAVNATGHLSDTLWLIPm^TlGYGDWPGTMWG^ 

1 0 TALLVAWARKLEFNKAEKHVHNFMMDIQYTKEMKESAARVL 

XHQRXlXAAINAFRQVRLKHRKLREQVNSMVDISKMHMILYDLQQh^ 
KLDALTELLSTALGPRQLPEPSQQSK (SEQ ID NO: 619), as well as N-terminal and C- 
terminal deletions. Also preferred are polynulcleotide fragments encoding these 
polypeptide fragments. 

15 This gene is expressed primarily in breast lymph node and T-cells, and to a 

lesser extent in placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematologic and 

20 inimune disorders. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., lymphoid 

25 tissue, blood cells and placenta, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the. standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 

30 sequence shown in SEQ ID NO. 325 as residues: Arg-13 to Lys-23. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment/diagnosis of hematologic and 
diseases involving immune modulation based or distribution in the lymph node and T- 
cells. 

35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

This gene was recently cloned by another group, calling it PAPS synethase. 
(See Accession No. e 1204 135.) Preferred polypeptide fragments comprise the^amino 
acid sequence YQAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGK (SEQ ID NO: 6^0). 
5 Also preferred are the polynucleotide fragments encoding this polypeptide fragment. 

It has been discovered that this gene is expressed primarily in benign prostate 
hyperplasia. Human Umbilical Vein Endothelial Cells and to a lesser extent in smooth 
muscle and Human endometrial stromal cells-treated with estradiol. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: inflamation, ischemia, 
and restenosis, based on endothelial cell and smooth muscle cell expression, and 
prostate diseases such as benign prostate hyperplasia or prostate cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the prostate or vessels 
of the circulatory system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., prostate, endothelial 
cells, smooth muscle, and endometrium, and cancerous and wounded tissues) or bodily 

20 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO. 326 as residues: Arg-21 to Asp-26, Lys-35 to Lys-44, 

25 Glu-49 to Asn-58. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating/diagnosing diseases or conditions 
where the endothelial cell lining of the veins and arteries of underlying smooth muscle 
are involved. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

This gene is expressed primarily in human 6 week embryo and to a lesser extent 
in placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: developmental 
anomalies or fetal deficiencies. Similarly, polypeptides and antibodies directed to these 
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15 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly developmental in nature, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., embryonic 
5 tissue, and placenta, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
10 NO. 327 as residues Lys-50 to Glu-57. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for detection of developmental abnormalities. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

15 This gene is expressed primarily in kidney and amygdala and to a lesser extent 

in fetal tissues. This gene is mapped to chromosome 14, and therefore is useful in 
linkage analysis as a marker for chromosome 14. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) present in a biological sample and 

20 for diagnosis of diseases and conditions: kidney diseases, neurological disorders and 
developmental abnormalities. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s). For a number of disorders of the above tissues, particularly of the renal 
system or developing fetal tissues, expression of this gene at significantly higher or 

25 lower levels may be routinely detected in certain tissues and cell types (e.g., kidney, 

amygdala, and fetzil tissues, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment or diagnosis of conditions affecting 
the brain, kidneys and fetal development. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

This gene is expressed primarily in ovarian cancer. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: solid tumors similar to 
ovarian cancer Similarly, polypeptides and antibodies directed to these polypeptides are 
5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., ovarian and other 
reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 329 as residues Ser-5 1 to Val-56. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the treatment of solid tumors of the 
reproductive system such as ovarian cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 18 
This gene is expressed primarily in brain medulloblastoma.. Preferred polypeptide 
fragments comprise the amino acid sequence: IRHEQHPNFSLEMHSKGSSLLLFLPQL 
ILILPVCAHLHEELNC (SEQ ID NO: 643) and SFFISEEKGHLLLQAERHPWVAGALVGVSG 
GLTLTTCSGPTEKPATKNYFLKRLLQEMHIRAN (SEQ ID NO: 644). as well as N-terminal 
and C-terminal deletions. Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors particularly of 
the CNS or Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the Central nervous system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., brain and other 
tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., semm, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 

30 sample taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating meduUoblastoma or similar tumors. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

This gene is expressed primarily in adipocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions: obesity. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the adipose tissues 
expression of this gene at significantly higher or lower levels may be routinely detected 

15 in certain tissues and cell types (e.g., adipocytes and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

20 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating obesity by regulating the function and 
number of adipocytes 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

25 This gene is expressed primarily in B cell lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, of the immune system with an emphasis on B cell lymphoma. Similarly, 

30 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the tumors of the 
inmiune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., blood cells, and lymphoid 

35 tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of B cell derived 
5 tumors based on its expression in b cell lymphomas 

FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

This gene is expressed primarily in immune cells and to a lesser extent in fetal 

tissues 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inflanmiatory diseases 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 

15 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., cells of the inunune system, and fetal 
tissues, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

20 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:333 as residues Asp-10 to Pro-19, Ser-74 to Tyr-79, Glu-95 to Lys-110. 
The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for treatment of diseeises involving alterations in T 
cell activity. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

It has been discovered that this gene is expressed primarily in ovarian tumor. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors particularly of 
the ovary. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

35 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
tumors of the reproductive organs, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., ovarian 
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and other reproductive tissue and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
5 individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO. 334 as residues: Leu-22 to Gln-27. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of ovarian tumors as it 
has only been identifled in ovarian tumors. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

It has been discovered that this gene is expressed primarily in fetal tissues and to 
a lesser extent in osteoclastoma cell line 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: osteoporosis or arthritis 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

20 skeletal expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., bone cells, and fetal tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

25 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of conditions of abnormal bone 
remodeling due to enhanced activity of osteoclasts. This may be useful as a specific 
marker for malignancies derived from osteoclasts or their precursors. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

The translation product of this gene shares sequence homology with a 
periplasmic ribonuclease which is thought to be important in degrading extracellular 
polynucleotides 

35 It has been discovered that this gene is expressed primarily in semm treated 

smooth muscle cells 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: vascular disease such as 
restenosis. Similarly, polypeptides and antibodies directed to these polypeptides are 

5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the vasculamre expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., smooth muscle, arid cancerous 
and wounded tissues) or bodily fluids (e.g., semm, plasma, urine, synovial fluid or 

10 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 336 as residues: 
Gln-30 to Lys-36, and Pro-4 1 to Arg-48. 

15 The tissue distribution and homology to ribonucleases indicate that 

polynucleotides and polypeptides corresponding to this gene are useful for treatment of 
pathological conditions of smooth muscle associated with bacterial or viral infiltration 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 

20 This gene is expressed primarily in Early Stage Human Brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: human brain 
development and related diseases. Similarly, polypeptides and antibodies directed to 

25 these polypeptides are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the human brain development and related diseases, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissue of the nervous system, and 

30 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to this gene indicate that polynucleotides 
35 and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases affecting human brain development and related diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

It has been discovered that this gene is expressed primarily in human brain 

tissue.. 

5 Therefore, polynucleotides and polypeptides of the invention are usefiil as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: human brain diseases 
and other diseases related to brain diseases, which may be caused by-brain diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

10 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
human brain diseases, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

15 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell. sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 
20 and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
human brain diseases and other diseases related. , - - 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

It has been discovered that this gene is expressed primarily in Anergic T-cells. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune diseases, 
inflammatory diseases and diseases related to T lymph cells. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 

30 probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune diseases, 
inflammatory diseases and diseases related to T lymph cells, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 

35 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for immune diseases, 
5 inflanmiatory diseases and diseases related to T lymph cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

The translation product of this gene shares sequence homology with Shigella 

flexneri positive transcriptional regulator CriR (criR) gene which is thought to be 
10 importjmt in regulation of gene expression. 

This gene is expressed primarily in human synovial sarcoma and normal human 

brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions: human brain diseases 
particularly sarcomas of the synovium. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the human brain and synovium and other related human 

20 brain diseases, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain (e.g., synovial tissue, and brain and other tissue of the 
nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., semm, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

25 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. ~ 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of human synovial 
sarcoma and other related human brain diseases. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

This gene is expressed in bone marrow, infant brain, fetal liver and spleen, 
prostate and to a lesser extent in pineal gland, adipose tissue, kidney, adrenal gland, 
umbilical vein endothelial cells, and T cells. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
and for diagnosis of diseases and conditions: diseases related to bone marrow or 
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hematoplasUc tissues, prostate, kidney, adrenal gland, and cardiovascular tissue or 
organs. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing inimunological probes for identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the diseases related to 
5 hematoplastic tissues, inrunune system, prostate, kidney, adrenal gland, and 

cardiovascular tissue or organs, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., bone marrow, 
hematopoietic cells, pineal gland, adipose tissue, kidney, adrenal gland, endothelial 
cells, and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 

10 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 

15 and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases related to hematoplastic tissues, inunune system, prostate, kidney, adrenal 
gland, and cardiovascular tissue or organs. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

20 This gene is expressed primarily in meningea and to a lesser extent in breast and 

adult brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Diseases of the 

25 meningea and related brain diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the meningea and related brain diseases, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 

30 tissues and cell types (e.g., miningea, mammary tissue, and brain and other tissue of 
the nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

35 disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of diseases of the 
meningea and related brain diseases. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

This gene is expressed in meningea, fetal spleen, osteoblast and to a lesser 
extent in activated T-cells, endometrial stromal cells, fetal lung, HL-60, thymus, testis 
and endothelial cells. ^ 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: meningeal disease, 
osteoporosis, immune diseases, and hematoplastic diseases. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for identification of the tissue(s) or cell type(s). For a number of disorders of the 

15 above tissues or cells, particularly of the meningeal diseases, osteoporosis, immune 
diseases, and hematoplastic diseases, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, endometrium, lung, thymus, testis, and endothelial cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

20 fluid) or another tissue or cell 'sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to gene indicate that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 

25 meningeal, osteoporosis, inmiune diseases, hematoplastic diseases, testis diseases and 
lung diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

This gene is expressed primarily in human thymus and to a much lesser extent 
30 in infant brain, T-cells, smooth muscle, endothelial cells, bone marrow, human ovarian 
tumor and keratinocytes testes, osteoclastoma, breast, and tonsils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Diseases involving the 
35 thymus, particularly thymic cancer and diseases involving T-cell maturation. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
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number of disorders of the above tissues or cells, particularly of the thymus, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., thymus, brain, and other tissue of the nervous system, 
blood cells, bone marrow, ovaries, and testes, and other reproductive tissue, manmiary 
5 tissue, tonsils, melanocytes and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
10 The tissue distribution and homology to gene indicate that polynucleotides and 

polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases of the thymus particularly thyrnic cancer and diseases involving T-cell 
maturation. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

This gene is expressed primarily in human tonsils, and placenta, and to a lesser 
extent in adipocytes, melanocyte, and infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: inflammatory diseases, 
immune diseases, and obesity. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inflammatory diseases, immune diseases, and obesity, expression of 

25 this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., tonsils,- placenta, adipocytes, melanocytes, and brain and 
other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

30 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to this gene indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases such as inflammation, inmiune diseases, and obesity. 

35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

This gene is expressed in activated T cells, and to a lesser extent in pituitary, 
testis, and breast lymph node. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: diseases relating to T 
cells. Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
10 disorders of the inmiune system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., pituitary, testes 
and other reproductive tissue, mammary tissue, and lymphoid tissue, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
15 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of immune disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 35 
This gene is expressed primarily in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological disorders. 

25 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
diseases relating to neurological disorders, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 

30 brain, and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of neurological 
disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 36 

This gene is expressed primarily in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neurological disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

10 diseases relating to neurological disorders, expression of this gene at significantly 

higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

15 standard gene expression level, i.e., the expression level in healthy tissue or bodUy 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neurological 
disorders. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 37 

This gene is expressed primarily in human ovary. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: ovarian cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
lype(s). For a number of disorders of the above tissues or cells, particularly of the 
ovarian disorders such as those involving germ cells, ovarian follicles, stromal cells, 

30 expression of this gene at significantly higher or lower levels may be routinely detected 
in cert£un tissues and cell types (e.g., ovary and other reproductive tissue, and 
cancerous and wounded tissues) or bodily fiuids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

35 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of ovariopathy. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

This gene is expressed primarily in lymph node breast cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the breast cancer, 
10 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., mammary tissue and lymphoid tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
15 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for used as a diagnostic marker for breast cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

20 This gene is expressed primarily in brain and to a lesser extent in other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neuronal disorders such 
as trauma, brain degeneration, and brain tumor. Similarly, polypeptides and antibodies 

25 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and cancerous and wounded 

30 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

35 corresponding to this gene are useful for diagnosis and therapeutic treatment of 
neuronal disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

This gene is expressed in early stage human embryo, adrenal gland tumor, and 
immune tissues such as fetal liver, fetal spleen. T-cell. and myoloid progenitor cell line 
and to a lesser extent in ovary, colon cancer, and a few orther tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumorigenesis including 
adrenal gland tumor, colon cancer and various other tumors, developmental and 
immune disorders. Similarly, polypeptides and antibodies directed to tiiese polypeptides 
are usefiil in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of Uie above tissues or cells 
particularly of the cancer tissues, early stage human tissues, and immune system 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g.. liver, spleen, blood cells, bone marrow, ovary 
and other reproductive tissue, and colon, and cancerous and wounded tissues) or bodily 
fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or anotiier tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e.. the expression. level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to tiiis gene are usefiil for diagnosis and therapeutic treatment of immune 
and developmental disorders, and tumorigenesis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

This gene is expressed primarily in fetal lung, endothelial cells, liver, thymus 
and a few other immune tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune disorders such 
as immune deficiency and autoimmune diseases, pulmonary diseases. liver diseases, 
and tumor matasis. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells 
particularly of the fetal lung, liver, endothelial cells, and immune tissues, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
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tissues and cell types (e.g., lung, endothelial cells, liver, thymus, and other tissue of 
the immune system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
5 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis of immune disorders and pulmonary 
and hepatic diseases. Its promoter may also be used for immune system and lung- 
10 specific gene therapies. The expression of this gene in endothelial cells indicates that it 
may also involve in angiogenesis which therefore may play role in tumor matasis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

This gene is expressed primarily in liver, thyroid, parathyroid and to a lesser 
1 5 extent in fetal lung, stomach and early embryos. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: metabolic regulation, 
obesity, heptic failure, heptacellular tumors or thyroiditis and thyroid tumors. Similarly, 
20 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the digestive/endocrine 
system expression of this gene at significandy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., liver, thyroid, parathyroid, lung, 
25 stomach, and embryonic tissue, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
30 The tissue distribution and the extracellular locations indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for the detection 
and treatment of digestive/endocrine disorders, including metabohc regulation, heptic 
failure, malabsortion, gastritis and neoplasms. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

This gene is expressed primarily in Schizophrenic adult brain, pituitary, front 
cortex, hypothalmus and to a lesser extent in retina, adipose and stomach cancer and 
placenta. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: schizophrenia and other 
neurological disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

10 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nerve system, expression of this gene at significandy higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., retinal 
tissue, adipose, stomach, and placenta, and cancerous and wounded tissues) or bodily 
fluids (e.g., semm^ plasma, urine, synovial fluid or spinal fluid) or another tissue or 

15 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful in treatment/detection of disorders in the nerve 

20 system, including schizophrenia, neurodegeneration, and neoplasia. Additionally, a 
secreted protein in brain may serve as an endocrine. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

The translation product of this gene shares sequence homology with GTP 
25 binding proteins which are thought to be important in signal transduction and protein 
transport. 

This gene is expressed primarily in umbilical vein and microvascular endothelial 
cells, GM-CSF treated macrophage, anergic T cells, osteoblast, osteoclast, CD34+ cells 
and to a lesser extent in gall bladder. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: bone formation and 
growth, osteonecrosis, osteoporosis, angiogenesis and/or hematopoeisis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

35 inununological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the skeletal and 
hematopoeisis systems, expression of this gene at significantly higher or lower levels 



4SOOaD: <WO_9839448A^_> 



wo 98/39448 



32 



PCTAJS98/04493 



may be routinely detected in certain tissues and cell types (e.g., endothelial cells, blood 
cells, bone, and gall bladder, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
5 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to GTP binding proteins indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for 
treatment/detection of bone formation and growth, osteonecrosis, osteoporosis, and/or 
10 hematopoeisis because its involvement in the growth signaling or angiogenesis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 45 

The translation product of this gene shares sequence homology with signal 
sequence receptor gamma subunit which is thought to be important in protein 

15 translocation on endoplasmic reticulum. 

This gene is expressed primarily in adrenal gland, salivary gland, prostate, and 
to a lesser extent in endothelial cells and smooth muscle. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: protein secretion. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
secretory organs, expression of this gene at significantly higher or lower levels may be 

25 routinely detected in certain tissues and cell types (e.g., adrenal gland, salivary gland, 
prostate, endothelial cells, and smooth muscle, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. 

The tissue distribution and homology to SSR gamma subunit indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for endocrine 
disorders, prostate cancer, xerostomia or sialorrhea. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

This gene is expressed primarily in osteoclastoma cells and to a lesser extent in 
melanocyte, amygdala, brain, and stomach. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: ossification, 
osteoporosis, fracture, osteonecrosis, osteosarcoma. Similarly, polypeptides and 
5 antibodies directed to these polypeptides are useful in providing inununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the skeletal systems, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., melanocytes, amygdala, brain and other tissue of the nervous system, 
10 and stomach, and cancerous and wounded tissues) or bodily fluids (e.g., semm, 

plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful in intervention of ossification, osteoporosis, 
fracture, osteonecrosis and osteosarcoma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

20 The translation product of this gene shares sequence homology with proline rich 

proteins which is thought to be important in protein-protein interaction. 
This gene is expressed primarily in brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: neurological and 
psychological disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inmiunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nerve system and endocrine system, expression of this gene at 

30 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

35 fluid from an individual not having the disorder. 

The tissue distribution and homology to proline-rich proteins indicate that 
polynucleotides and polypeptides corresponding to this gene are useful in intervention 
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and detection of neurological diseases, including trauma, neoplasia, degenerative or 
metabolic conditions in the central nerve system. Additionally, the gene product may be 
a secreted by the brain as an endocrine. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

The translation product of this gene shares sequence homology with the AOCB 
gene from Aspergillus nidulans which is important in asexual development. 

This gene is expressed primarily in infant brain and to a lesser extent in the 
developing embryo, trachea tumors, B-cell lymphoma and synovial sarcoma. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurodegenerative 
diseases, leukemia and sarcoma's. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 

15 identification of the tissue(s) or cell type(s). For a number of disorders of the above 

tissues or cells, particularly of the brain and immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., embryonic tissue, blood cells, trachea, and synovial tissue, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 

20 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in infant brain and sarcoma's and homology to a gene 
involved in a key step of eukaryotivc development (fungal spore formation) indicates 

25 that the protein product of this clone could play a role in neurological diseases such as 
schizophrenia, particularly in infants. The existence of the gene in a B-cell lymphoma . 
indicates the gene may be used in the treatment and detection of leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

30 This gene is expressed primarily in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: pulmonary disorders 
including lung cancer. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the pulmonary system, expression of this gene at significantly higher or 
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35 

lower levels may be routinely detected in certain tissues and cell types (e.g,, lung, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution of this gene only in fetal lung indicates that it plays a key 
role in development of the pulmonary system. This would suggest that misregulation of 
the expression of this protein product in the adult could lead to lymphoma or sarcoma 
formation, particularly in the lung. It may also be involved in predisposition to certain 
10 pulmonary defects such as pulmonary edema and embolism, bronchitis and cystic 
fibrosis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

This gene is expressed primarily in hematopoietic cell types and fetal cells and to 

15 a lesser extent in all tissue types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in the immune 
system and hematopoeisis. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and hematopoietic systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic cells, and fetal tissue, and cancerous and wounded tissues) 

25 or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, reladve to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution of this gene predominantly in hematopoeitic cells and in 

30 the developing embryo indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for detection and treatment of lymphomas and disease states 
affecting the immune system or hematopoeisis disorders such as leukemia, AIDS, 
arthritis and asthma.. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

This gene is expressed primarily in prostate and to a lesser extent in fetal spleen, 
fetal liver, infant brain and T cell leukemias. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: prostate disorders, 
prostate cancer, leukemia. Similarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, and/or prostate gland expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., thymus, spleen, liver, brain and other tissue of the nervous system, and 
10 blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution of this gene in prostate indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for detection or treatment of prostate 
disorders or prostate cancer. Its distribution in fetal liver and fetal spleen indicates it 
may play a role in the immune system and its misregulation could lead to immune 
disorders such as leukemia, arthritis and asthma. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

The translation product of this gene shares sequence homology with dynein. 
This gene is expressed primarily in brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neuro-degenerative 
diseases of the brain. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inmiunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
30 particularly neuro-degenerative diseases expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., brain 
and other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
35 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
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The predominant tissue distribution in the brain and homology to dynein, a 
microtubule motor protein involved in the positioning of cellular organelles and 
molecules indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for detection/treatment of neurodegenerative diseases^ such as Alzheimers, 
5 Huntigtons, Parkinsons diseases and shizophrenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

The translation product of this gene shares sequence homology with ubiquitin- 

conjugation protein, an enzyme which is thought to be important in the processing of 
10 the Huntingtons Disease causing gene. 

This gene is expressed primarily in brain and to a lesser extent in activated 

macrophages. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions: neurodegenerative 

disease states including Huntington's disease. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of brain tissues. For a number of disorders of the above 
tissues or cells, particularly of the neurological systems expression of this gene at 

20 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and blood cells, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

25 in healthy tissue or bodily fluid from an individual not having the disorder. 

The predominant tissue distribution of this gene in the brain and its homology to 
a Huntington interacting protein indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the regulation of the expression of the 
Huntington disease gene and other neurodegenerative diseases including 

30 spinocerebullar ataxia types I and HI, dentatorubropallidoluysian and spinal bulbar 

muscular atrophy. In addition, the existence of elevated levels of free ubiquitin pools in 
Alzheimer's disease, Parkinson*s disease and amylotrophic lateral sclerosis indicates 
that the ubiquitin pathway of protein degradation plays a role in these disease states. 
Thus, considering the gene described here is homologous to a ubiquitin-conjugation 

35 protein it may play a general role in neurodegenarative conditions. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 56 

This gene is expressed primarily in T-cells (anergic T-cells, resting T-Cells, 
apoptotic T-cells) and lymph node (breast), as well as brain (hypothalamus, 
hippocampus, pituitary, infant brain, early-stage brain). 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune (e.g, 
inununodeficiencies, autoimmunities, inflammation, leukemias & lymphomas) and 
neurological (e.g. Alzheimer's disease, dementia, schizophrenia) disorders. Similarly, 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the central nervous, 
hematopoietic and immune systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 

15 cells, lymphoid tissue, and brain and other tissue of the nervous system, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

20 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful in the intervention or detection of pathologies 
associated with the hematopoietic and immune systems, such as anemias (leukemieis). 
In addition, the expression in brain (including fetal) might suggest a role in 
developmental brain defects, neuro-degenerative diseases or behavioral abnomalities 

25 (e.g. schizophrenia, Alzheimer*s, dementia, depression, etc.). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 57 

This gene is expressed primarily in lung, and to a lesser extent in a variety of 
other hematological cell types (e.g. Raji cells, bone marrow cell line, activated 

30 monocytes). 

Therefore, polynucleotides £uid polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: pulmonary and/or 
hematological disfunction. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the vasculo-pulmonary and hematopoietic systems, expression of this 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

This gene is expressed primarily in CD34 positive cells (Cord Blood). 
Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematopoieUc 
differentiation and immune disorders. Similarly, polypeptides and antibodies duected to 
these polypeptides are useful in providing immunological probes for differenU^ 
identification of the ussue(s) or cell type(s). For a number of disorders of the above 
10 tissues or cells, particularly of hematopoietic and immune systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certam Ussues 
and cell types (e.g., hematopoietic cells, and blood cells, and cancerous and wounded 
tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
15 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypepUdes 
couesponding to this gene are usefiil in the detection and treatment of condiUons 
associated with CD34.positive cells, and therefore as a marker for cell differentiation m 
20 hematapoiesis, as well as immunological disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

The translation product of the predicted open reading frame of this conUg has 
sequence identity to the murine gene designated Insulin-Like ^^^^^f 
25 Protein (IGFBP)-l as described by Lee and colleagues (Hepatology 19 (3), 656-665 

(1994)). 

This gene is expressed exclusively in hemangiopencytoma. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present m a 
30 biological sample and for diagnosis of hemangiopericytoma and other pencyte or 

endotllial cell proliferative disorders. Similarly, polypeptides and antibodies duected 
to these polypeptides are useful in providing immunological probes for differenUal 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the circulatory and immune systems, expression of this 
35 gene at significantly higher or lower levels may routinely be detected m certain Ussues 

nd cell t^pes (e.g., pericyte or endothelial cells, and liver, and cancerous ^d — ^ 
tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: metabolic, endocrine 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 

5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the endocrine and nervous system disorders and neoplasia, expression of this gene at 
significantiy higher or lower levels may be routinely detected in certain tissues and cell 
types (e,g., adrenal gland, kidney, brain and other tissue of the nervous system, 

10 pulmonary tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for study, treatment and diagnosis of neurological 
and endocrine disorders including neoplasia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

20 This gene is expressed primarily in human adipocytes, and to a lesser extent in 

spleen, 12-week old human, and testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inmiune, metabolic and 

25 growth disorders. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inunune system, expression of this gene at significantiy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., adipocytes, 

30 spleen, and testes and other reproductive tissue, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for study, diagnosis and treatment of immune, 
developmental and metabolic disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 64 
One translated product of this clone is homologous to the mouse zinc finger protein 
PZF. (See Accession No. 453376; see also Gene 152 (2), 233-238 (1995).) Preferred 
5 polypeptide fragments correspond to the highly conserved domains shared between 

mouse and man. For example, preferred polypeptide fragments comprise the amino acid 
sequence: LQCEICXjFTCRQKASLNWHMKKHDADSFifQFSCOTCGKKFEKK^ 
PEV (SEQ ID NO: 621); ITSTDILGTNPESLTQPSD (SEQ ID NO: 622); NSTSGECLLLEAEGM 
SKSY (SEQ ID NO: 623); CSGTERVSLMADGKIFVGSGSSGGTEGLVMNSDILGATTEVLIEDSD 

1 0 SAG? (SEQ ID NO: 624); IQYVRCEMEGCGTVLAHPRYLQHHIKYQHLLKKKYVCPHPSCGRLF 
RLQKQLLRHAKHHT (SEQ ID NO: 625); DQRDYICEYCARAFKSSHNLAVHRMIHTGEK (SEQ 
ID NO: 626); RSSRTSVSRHRDTENTRSSRSKTGSLQLICKSEPNTDQLDY (SEQ ID NO: 627); 
PFKDDPRDETYKPHLERETPKPRRKSG (SEQ ID NO: 630); QYVRCEMEGCGTVLAHPRYLQ 
HHIKYQHLLKKKYVCPHPSCGRLFRLQKQLLRHAKHHTD (SEQ ID NO: 629); Or residues 

15 1 5 1-1 82 of QRD YICEYCARAFKSSHNLAVHRMIHTGEKHY (SEQ ID NO: 628). Also preferred 
are polynucleotide fragments encoding these polypeptide fragments. 

This gene is expressed primarily in Rhabdomyosarcoma, melanocyte and colon 
cancer tissue and to a lesser extent in smooth muscle, pancreatic tumor, and apoptotic 
T-cells. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to,. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and hemopoetic, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., striated muscle, 
melanocytes, colon, smooth muscle, pancreas, and blood cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

30 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, diagnosis and treatment of cancer and 

35 hemopoetic disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 65 

This gene is expressed primarily in human adipose and salivary gland ussue and 
to a lesser extent in human bone marrow and fetal kidney. 

Therefore. polynucleoUdes and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or ceU type(s) present m a 

biological sample and for diagnosis of diseases and conditions: metabolic and immune 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
usefiil in providing immunological probes for differential identification of the Ussue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
10 the metabolic and hemopoetic systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., adipose, 
salivary gland, bone marrow, and kidney, and cancerous and wounded ussues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to Ae 
15 standard gene expression level, i.e.. the expression level in healthy tissue orbodUy 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, diagnosis of metabolic and immune 
disorders. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 66 

This translated product of this gene was recentiy identified as oxytocinase splice vanant 
1 (See Accession Nos. 2209276 and dl010078.) Preferred polypeptide fragments 
comprise the amino acid sequence: eMFDSLSYFKGSSLLLMLKTYLSEDVFQHAVVLYLHN 
25 HSYASIQSDDLWDSFNEVTNQT1-DVKRMMKTWT1.QKGFPLVTVQKKGKELHQQERFFL^^ 
PEIQPSDTRYM (SEQ ID NO: 631). Also preferred are polynucleotide fragments encodmg 
this polypeptide fragment. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 

30 This gene is expressed primarily in hemopoetic cells, particularly apoptotic T- 

cells, and to lesser extent in primary dendritic cells and adipose tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of apoptotic T-cells, primary denriuc ceUs. and 
adipose tissue present in a biological sample and for diagnosis of diseases and 
35 conditions: hemopoetic diseases including cancer and general immune disorders. 
Similarly polypeptides and antibodies directed to tiiese polypeptides are useful m 
providing immunological probes for differential identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the oral 
and intestinal mucosa as well as hemopoetic and immune systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., hematopoietic cells, and cancerous and wounded tissues) or bodily 
5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for treatment of diseases of the inunune system, 
including cancer, hemopoetic and infectious diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

This gene is expressed primarily in kidney cortex and to a lesser extent in infant 

15 brain, heart, uterus, and blood. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of kidney tissue present in a biological sample and 
for diagnosis of diseases and conditions: soft tissue cancer, inflammation, kidney 
fibrosis. Similarly, polypeptides and antibodies directed to these polypeptides are useful 

20 in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
nervous and endocrines systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., kidney, 
brain, and other nervous tissue, heart, uterus, and blood cells, and cancerous and 

25 wounded tissues) or bodily fluids (e.g., semm, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for study and treatment of cancer and fibroses. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

The translation product of this gene shares strong sequence homology with 
vertebrate and invertebrate protein tyrosine phosphatases. 
35 This gene is expressed primarily in endometrial tumors, melanocytes, myeloid 

progenitors and to a lesser extent in infant breun, adipocytes, and several hematopoietic 
stem cells. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of transformed hematopoietic and epithelial cells 
present in a biological sample and for diagnosis of diseases and conditions which 
include, but are not limited to, of skin and endometrium, leukemia. Similarly, 
5 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the nervous and 
hemopoietic systems, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., endometrium, melanocytes, 

10 bone marrow, adipocytes, hematopoietic cells, and brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., semm, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

15 disorder. 

The tissue distribution and sequence similarity with tyrosine phosphatases 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
study and treatment of cancer and hematopoietic disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 70 

This gene is expressed primarily in osteoclastoma, breast, and infant brain and 
to a lesser extent in various fetal and transformed bone, ovarian, and neuronal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: degenerative conditions 
of the brain and skeleton. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous and skeletal system, expression of this gene at significantly 

30 higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
bone, mammary tissue, and brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

35 in healthy tissue or bodily fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study and treatment of degenerative, 
neurological and skeletal disorders. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

This gene was originally cloned from tumor cell lines. Recently another group 
has also cloned this gene, calling it the human malignant melanoma metastasis- 
suppressor (KiSS-1) gene. (See Accession No. U43527.) Preferred polypeptide 
fragments comprise the annino acid sequence: LEKVASVGNSRPTGQQLESLGLLA (SEQ ID 

10 NO: 632); VHREEASCYCQAEPSGDL (SEQ ID NO: 633); RPALRQAGGGTREPRQKRWAGL 

(SEQ ID NO: 634); and AVNFRPQRSQSM (SEQ ID NO: 635). Any frame shifts can easily be 
resolved using known molecular biology techniques. 

This gene is expressed primcirily in many types of carcinomas and to a lesser 
extent in many normal organs. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissues(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly melanomas, and other hyperproliferative disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of 
transformed organ tissue, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

25 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. As a tumor suppressor gene, increase 
amounts of the polypeptide can be used to treat patients having a particular cancer. 
The tissue distribution indicates that this gene and the translated product is 

30 useful for diagnosing and study of cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

This gene is expressed primarily in striatum and to a lesser extent in adipocytes 
and hemangioperiocytoma. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of striatal cells present in a biological sample and 
for diagnosis of diseases and conditions: neurological, fat and lysosomal storage 
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diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the nervous and immune systems, expression of this gene at significantly higher or 

5 lower levels may be routinely detected in certain tissues and cell types (e.g., striatal 
tissue, adipocytes, and vascular tissue, and cancerous and wounded tissues) or bodily 
fluids (e.g., semm, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

10 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis, study and treatment of 
neurodegenerative and growth disorders. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

This gene is expressed primarily in bone marrow stromal cells and to a lesser 
extent in smooth muscle, testes, endothelium, and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of bone marrow present in a biological sample and 

20 for diagnosis of diseases and conditions: connective tissue and hematopoietic diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
skeletal and hematopoietic systems, expression of this gene at significantly higher or 

25 lower levels may be routinely detected in certain tissues and cell types (e.g., bone 
marrow, stromal cells, smooth muscle, testes and other reproductive tissue, 
endothelium, brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

30 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, diagnosis, and treatment of connective 
tissue and blood diseases. 

35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 74 

This gene is expressed primarily in brain, fetal liver and lung and to a lesser 
extent in retina, spinal chord, activated T-cells and endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of brain and regenerating liver present in a 

biological sample and for diagnosis of diseases and conditions: CNS and spinal chord 
injuries, inunune disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

10 particularly of the nervous and immune system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, liver, pulmonary tissue, blood cells, and 
endothelial cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

15 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study and treatment of hematopoietic and 
20 neurological conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

The translation product of this gene shares sequence homology with GTP 
binding proteins (intracellular). 

This gene is expressed primarily in bone marrow, brain, and melanocytes and to 
a lesser extent in various endocrine and hematopoietic tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematopietic and 
nervous system conditions. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a nurnber of disorders of the above tissues or cells, 
particularly of the nervous and immune, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., bone 
marrow, melanocytes, brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder. 



25 
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35 
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relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to nucleotide binding factors indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for study, 
5 diagnosis, and treatment of brain degenerative, skin and blood diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

This gene is expressed primarily in activated T-cells and to a lesser extent in 
retina, brain, and fetal bone. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of activated T-cells and developing brain present 
in a biological sample and for diagnosis of diseases and conditions: inwnune 
deficiencies and skeletal and neuronal growth disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

15 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the nervous, immune, and skeletomuscular 
sustems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., blood cells, brain and other tissue of the 
nervous system, retinal tissue, and bone, and cancerous and wounded tissues) or 

20 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) Or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for diagnosis, study and treatment of cancer, 
urogenital, and brain degenerative diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

This gene is expressed primarily in fetal liver, activated monocytes, osteoblasts 

30 and to a lesser extent in synovial, brain, and lymphoid tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of myeloid and lymphoid present in a biological 
sample and for diagnosis of diseases and conditions: inflammation, iiiunune 
deficiencies, cancer. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system and skeleton, expression of this gene at significantly 
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higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
liver, blood cells, bone, synovial tissue, brain and other tissue of the nervous system, 
and lymphoid tissue, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, diagnosis, and treatment of lymphoid 
and mesenchymal cancers and nervous system diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

The translation product of this gene shares sequence homology with polymerase 
polyprotein precursor which is thought to be important in DNA repair and replication 

This gene is expressed primarily in infant brain and to a lesser extent in tumors 
and tumor cell lines 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, especially of the neural system and developing organs. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the neural system 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain (e.g., brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 
30 The tissue distribution and homology to polymerase polyprotein precursor 

indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of cancers especially of the netiral system and developing 
organs 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

This gene is expressed primarily in muscle and endothelial cells and to a lesser 
extent in brain. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: vascular diseases. 
Siniilarly, polypeptides and antibodies directed to these polypeptides are useful in 

5 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
vascular system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain (e.g., muscle, endothelial cells, brain and other tissue of the 
nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and pK>lypeptides 
15 corresponding to this gene are useful for treatment and diagnosis of disorders of the 
vascular and neural system including cardiovascular and endothelial. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

This gene is expressed primarily in placenta and to a lesser extent in fetal liver 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: developmental disorders 
and disorder of the haemopoietic system, fetal liver and placenta. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

25 immunologiccd probes for differential identification of the tissue(s) or cell type(s). For a 
number of diisorders of the above tissues or cells, particularly of developmental 
disorders and disorder of the haemopoietic system, fetal liver and placenta, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., placenta and liver, and cancerous and wounded tissues) or 

30 bodily fluids (e.g., semm, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

35 corresponding to this gene are useful for diagnosis and treatment of developmental 
disorders and disorders of the haemopoietic system, fetal liver and placenta. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 81 

This gene is expressed primarily in bone marrow, placenta and tissues and 
organs of the hematopoietic system. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disorders of the bone 
and haemopoietic system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

10 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune, bone and hematopoietic system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., bone marrow, placenta, and hematopoietic cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

15 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of disorders of the 

20 inmiune, bone and hematopoietic system 

FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

The translation product of this gene shares sequence homology with secretory 
carrier membrane protein which is thought to be important in protein transport and 

25 export. Any frame shifts in coding sequence can be easily resolved using standard 
molecular biology techniques. Another group recently cloned this gene, calling it 
SCAMP. (See Accession No. 2232243.) 

This gene is expressed primarily in prostate, breast and spleen, and to a lesser 
extent in several other tissues and organs. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disorders of the breast 
prostate and spleen. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

35 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly disorders of the breast prostate and spleen, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
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types (e.g., prostate, mammary tissue, and spleen, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
5 fluid from an individual not having the disorder. 

The tissue distribution and homology to secretory carrier membrane protein 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of disorders of the breast, prostate and spleen. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 83 

This gene is expressed primarily in developing organs and tissue like placenta 
and infant brain and to a lesser extent in developed organs and tissue like cerebellum 
and heart. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neurological diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
20 neural system, expression of this gene at significantly higher or lower levels may be 

routinely detected in certain tissues and cell types (e.g., placenta, heart, brain and other 
tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
25 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of diseases of the 
neural system including neurological disorders and cancer. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 84 

The translation product of this gene shares sequence homology with ATPase 6 
in Trypanosoma brucei which is thought to be important in metabolism. 

This gene is expressed primarily in tumor and fetal tissues and to a lesser extent 
35 in melanocytes, kidney cortex, monocytes and ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions: metabolism disorders. 
Similarly, polyp)eptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the fetal 
5 systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., fetal tissues, melanocytes, kidney, blood 
cells, ovary and other tissue of the reproductive system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
10 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to ATPase indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for treatment and diagnosis of 
metabolism disorders, especially in fetal and tumor tissue growth. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 85 

The translation product of this gene shares sequence homology with the 
immunoglobulin superfamily of proteins which are known to be important in immune 
response and inmiunity. 

20 This gene is expressed primarily in stromal cells, colon cancer, lung, amygdala, 

melanocyte and to a lesser extent in a v£iriety of other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects of stromal cell 

25 development and cancer. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the stromal cells, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., stromal cells, 

30 colon, lung, amygdala, and melanocytes, and cancerous and wounded tissues) or 
bodily fluids (e.g., senun, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

35 The tissue distribution and homology to immunoglobulin indicate that 

polynucleotides and polypeptides corresponding to this gene are useful for treatment 
and diagnosis of immune system disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 86 

The translation product of this gene shares sequence homology with 

transcription iniation factor eIF-4 gamma which is thought to be important in gene 
5 transcription. 

This gene is expressed primarily in tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumorigenesis. 

10 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly in tumor 
tissues, expression of this gene at significantly higher or lower levels niay be routinely 
detected in certain tissues and cell types (e.g., endometrium and lung, and cancerous 

15 and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to transcription iniation factor eIF-4 

20 gamma indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for gene regulation in tumorigenesis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 87 

The translation product of this gene shares sequence homology at low level in 
prolines with secreted basic proline-rich peptide II-2 which is thought to be important in 
protein structure or inhibiting hydroxyapatite formation in vitro. 

This gene is expressed primarily in endometrial tumor and fetal lung. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: endometrial tumors. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing innmunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
muscular/skeletal and reproductive systems, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
endometrium, and lung, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
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biological sample and for diagnosis of diseases and conditions: leukemia and 
hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

5 particularly of the hematopoietic and immune systems, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic cells, liver, spleen, bone, testes, and other reproductive 
tissue, brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal Huid) or 

1 0 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodUy 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of hematopoietic and 

15 immune disorders. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 90 

The translation product of this gene shares weak sequence homology with 
mouse Gcapl protein which is develdpmentally regulated in brain. 
20 This gene is expressed primarily in infant and adult brain and fetal liver/spleen 

and to a lesser extent in smooth muscle, T cells, and a variety of other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological or 
25 hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous, hematopoietic, immune, and endocrine systems, expression 
of tills gene at significantiy higher or lower levels may be routinely detected in certain 
30 tissues and cell types (e.g., brain and other tissue of the nervous system, blood cells, 
liver, spleen ,and smooth muscle, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
35 individual not having the disorder. 
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The tissue distribution and its homology to Gcapl protein indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for treatubg and 
diagnosis of disorders in neuronal, hematopoietic, inunune, and endocrine systems. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

This gene is expressed primarily in brain and hematopoietic cells and to a lesser 
extent in tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions: disorder in nervous, 
hematopoietic, immune systems and tumorigenesis. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the in nervous, hematopoietic, inunune 

15 systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

20 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of disorders in the nervous, hematopoietic, and immune 
systems. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

The translation product of this gene shares sequence homology with 
neuroendocrine-specific protein A which is thought to be important in neurologic 
systems. 

30 This gene is expressed primarily in brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases <md conditions: neural disorders and 
degeneration disease. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central or peripheral nervous systems, expression of this gene at 
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significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic cells, and brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to neuroendocrine-specific protein A 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
treatment or diagnosis of neural disorders and degeneration disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

The translation product of this gene shares sequence homology with collagen- 
like protein and prolin-rich protein which are thought to be important in connective 
tissue function and tissue stmcture. 

This gene is expressed primarily in fetal liver/spleen and brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neuronal or 
hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous and hematopoietic systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., liver, spleen, and brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovieil 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to collagen-like protein and proline-rich 
proteins indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for supporting brain and hematopoietic tissue function and diagnosis and 
treatment of disorders in these functions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

This gene is expressed primarily in embryonic tissues and tumor tissues. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to,. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the inmiune system (e.g., tumors), expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., embryonic 
tissue and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasmeu 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

This gene is expressed primarily in brain tumor, placenta^ and melanoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: brain tumor or 
melanoma. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inmiunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the brain or melanocytes, expression of this gene at significantly higher or lower levels . 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, placenta, and melanocytes, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the translation product of this gene is useful 
in the diagnosis and treatment of brain tumors and melanoma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 96 

The translation product of this gene shares sequence homology with a yeast 
membrane protein, SUR4, which encodes for APAl that acts on a glucose -signaUng 
pathway that controls the expression of several genes that are transcriptionally regulated 
by glucose. 
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This gene is expressed primarily in fetal liver, and to a lesser extent in placenta 
and breast tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biologiczil sample and for diagnosis of diseases and conditions: defects of fetal liver or 
defects of glucose-regulated ATPase activities in tissues. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the fetal immune/hematopoietic system, 

10 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, placenta, and mammary tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

15 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to yeast SUR4 membrane protein indicate 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of defects of fetal liver or defects of glucose-regulated ATPase 
activities. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 97 

This gene is expressed primarily in fetal liver, brain, and amniotic fluid. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: defects of the fetal 

immune system and adult brain. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the fetal immune system and adult brain, expression of this gene at 

30 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., liver, and brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., amniotic fluid, serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken fix>m an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

35 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for detecting defects of the fetal immune and hematopoietic systems since fetal liver is 
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the predominant organ responsible for hematopoiesis in the fetus. In addition, the gene 
product of this gene is thought to be useful for detecting certain neurological defects of 
the brain. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 98 

The translation product of this gene shares sequence homology with an yolk 
protein precursor, Vitellogenin which is thought to be important in binding lipids such 
as phosvitin. 

This gene is expressed primarily in amniotic cells and fetal liver. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in amniotic 
cells, fetal liver development and the fetal immune system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

15 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the [insert system where a related disease 
state is likely, e.g., immune], expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., amniotic cells, 
and liver, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

20 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to vitellogenin indicate that the protein 
25 product of this clone is useful for treatment and diagnosis of defects in amniotic cells, 
fetal liver development and the fetal immune system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 99 

This gene is expressed primarily in placenta, endometrial tumor, osteosarcoma 
30 and stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumor of the 
endometrium or bone, and osteosarcoma. Similarly, polypeptides and antibodies 
35 directed to these polypeptides are useful in providing inununological probes for 

differential identification of the tissue($) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the obstetric system (e.g. placenta. 
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endometrium) and the bones, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., placenta, 
endometrium, bone, and stromal cells, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spina] fluid) or another tissue or 
5 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of tumors and 
10 abnormalities of the endometrium, and the bones because of its abundance in the 
aforementioned tissues.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 100 

This gene is expressed primarily in hepatocellular tumor. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hepatocellular tumor. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identiflcation of the tissue(s) or cell 

20 type(s). For a number of disorders of the above tissues or cells, particularly of the liver, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

25 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of hepatocellular cancer because of its abundant expression 
in this tissue. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 101 

This gene is expressed primarily in Corpus Colosum, fetal lung and infant 

brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: defects of the Corpus 
Colosum or defects of the fetal lung. Similarly, polypeptides and antibodies directed to 
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these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the Corpus Colosum and brain in general, and fetal lung, 
expression of this gene at significantly higher or lower levels may be routinely detected 
5 in certain tissues and cell types (e.g., lung, and brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e.» 
the expression level in healthy tissue or bodily fluid from an individual not having the 
10 disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of defects of the Corpus Colosum and brain in general, and 
defects of fetal lung. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 102 

This gene is expressed primarily in T cells and stromal cells, and to a lesser 
extent in adrenal gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: defects of T cell 

immunity and stromal cell development. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 

25 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, stromal cells, and adrenal gland, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of defects of T cell immunity and stromal cell development 
because of its abundant expression in these tissues. 



35 FEATURES OF PROTEIN ENCODED BY GENE NO: 103 

This gene is expressed primarily in infant brain and placenta. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 106 

This gene is expressed primarily in fetal liver/spleen, and to a lesser extent in 
acdvated T-Cells. 

Therefore, polynucleotides and polypeptides of the invendon are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and condidons: inimune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

10 immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., liver, spleen, and blood cells, 
and cancerous and wounded tissues) or bodily fluids (e.g., semm, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 

15 expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of immune disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 107 

This gene is expressed primarily in human embryo and to a lesser extent in 
spleen and chronic lymphocytic leukemia. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: leukemia. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inununological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune or 
hemopoietic systems, expression of this gene at significantly higher or lower levels may 

30 be routinely detected in certain tissues and cell types (e.g., embryonic tissue, spleen, 
and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., semm, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

35 disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for the diagnosis and treatment of leukemia. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 108 

This gene is expressed primarily in placenta, and to a lesser extent in early stage 
human brain and in lung. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: fetal developmental 
abnormalities. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inununological probes for differential identification of the tissue(s) 

10 or cell type(s). For a number of disorders of the above tissues or cells, particularly in 
fetal and amniotic tissue, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., placenta, brain and 
other tissue of the nervous system, and lung, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

15 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this is useful for 
production of growth factor(s) associated with fetal development. Preferred 

20 polypeptides comprise the full-length polypeptide shown in the sequence listing, 
truncated however, at the amino terminus and beginning with QTIE. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 109 

This gene is expressed primarily in fetal spleen, and to a lesser extent in B-Cell 

25 lymphoma and T-Cell lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: lymphoma. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

30 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., spleen and blood cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

35 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 
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The tissue distribution indicates that the protein product of this clone is useful 
for the treatment and diagnosis of human lymphomas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 110 
5 The translation product of this gene shares sequence homology with sarcoma 

amplified sequence (SAS), a tetraspan receptor which is thought to be important in 
malignant fibrous histiocytoma and liposarcoma. 

This gene is expressed primarily in human osteoclastoma, and to a lesser extent 
in pineal gland and infant brain. 

10 Therefore, polyniicleotides and polypeptides of the invention are useful as 

reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: malignant fibrous 
histiocytoma and liposarcoma. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., bone, 
pineal gland, and brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

20 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to sarcoma amplified sequence (S AS) 
indicate that the protein product of this clone is useful for treatment of, osteosarcoma, 

25 malignant fibrous histiocytoma and liposarcoma and related cancers, particularly 
sarcomas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 111 

The translation product of this gene shares sequence homology with 6.8K 
30 proteolipid protein, mitochondrial - bovine. 

This gene is expressed primarily in Wilm*s tumor and to a lesser extent in 

cerebellum and placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: Wilm's tumor. 

Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

providing inmiunological probes for differential identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune or renal systems, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, and placenta, and cancerous and wounded tissues) or bodily fluids 

5 (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to 6.8K proteolipid protein indicate that 

10 the protein product of this clone is useful for diagnostic and therapeutics associated with 
tumors, particularly Wilm's tumor disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 112 

This gene is expressed primarily in embryonic tissue and to a lesser extent in 

15 osteoblasts, endothelial cells, macrophages (GM-CSF treated), and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inunune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., embryonic tissue, bone, 
endothelial cells, blood cells and bone marrow, and cancerous and wounded tissues) or 

25 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for treatment or diagnosis of immune disorders. 
Preferred polypeptides encoded by this gene comprise the following amino acid 
sequence: MITDVQLAIFANMLGVSLFLLWLYHYVAVNNPKKQE (SEQ ID NO: 636). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 113 
35 This gene is expressed primarily in hepatocellular tumor, and to a lesser extent 

in fetal liver/spleen. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors, particularly 
hepatocellular tumors. Similarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hepatic system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., liver, and 
spleen, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
10 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
15 for diagnosis and treatment of tumors, particularly hepatocellular tumors. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 114 

The translation product of this gene exhibits a very high degree of sequence 
identity with the human Pig8 gene which is thought to be important in p53 mediated 

20 apoptosis. The sequence of this gene has since been published by Polyak and 

colleagues (Nature 389, 300-306 (1997)). In addition, the predicted translation product 
of this contig exhibits very high sequence homology with a murine gene denoted as 
EI24 which is also thought to be important in p53 mediated apoptosis. 

This gene is expressed primarily in infant brain and activated T-cells and to a 

25 lesser extent in bone marrow, fetal liver, and prostate. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, and tissue damage by radiation and anti-cancer dmgs. Similarly, 

30 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the nervous and 
immune systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., brain and other tissue of the 

35 nervous system, blood cells, bone marrow, liver, and prostate, and cancerous and 

wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
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relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to human Pig8 and murine EI24 genes 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
5 preventing apoptosis in patients being treated with anti-oncogenic drugs such as 

etoposide, hydroperoxycyclophosphamide, and X-irradiation, since this protein product 
is upregulated in cells undergoing such treatment where p53 was overexpressed. It may 
also be useful in the treatment of hematopoietic disorders and in boosting numbers of 
hematopoietic stem cells by interfering with the apoptosis of progenitor cells. The 

10 mature polypeptide is predicted to comprise the following amino acid sequence: 
EEMADSVKTFLQDLARGIKDSIWGICTISKLDARIQQKREEQRRRRASSVIJVQRI^ 
EPRWSRIFQCCAWNGGVFWFSLLLFi^RVnPVLQSVTARIIGDPSLHGDVWSWLEFFLTS^ 
LWVLPLFVLSKVVNAIWQDIADLAFEVSGRKPHPFPSVSKIIADMLFhO-LLQALR-IQGMF^ 
FPIHLVGQLVSLLHMSLLYSLYCFEYRWFNKGIEMHQRLSNIERNWPYYFGFGLPLAFLTAMQ 

15 SSYIISGCLFSILFPLniSAhnEAKTPGKAYLFQLRLFSLVVFLSNRLFHKTVYLQS>^^ 

FPSPHPSPAKLKATAGH (SEQ ID NO: 637). Accordingly, polypeptides comprising the 
foregoing amino acid sequence are provided as are polynucleotides encoded such 
polypeptides. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 115 

This gene is expressed primarily in stromal cells and to a lesser extent in 
multiple sclerosis. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: affecting the nervous 

system. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
nervous system, expression of this gene at significantly higher or lower levels may be 

30 routinely detected in certain tissues and cell types (e.g., stromal cells and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment and diagnosis of multiple sclerosis 
and other autoinunune diseases. 
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an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to the dystrophin gene indicate that 
5 polynucleotides and polypeptides corresponding to this gene are useful for diseases 
related the degenerative myopathies that are characterized by the weakness and atrophy 
of muscles without neural degradation; such as Duchenne and Becker's muscular 
dystropies. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 118 

This gene is expressed primarily in olfactory tissue and to a lesser extent in 
cartilage. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions: connective tissue 
diseases; chondrosarcoma. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the connective tissue, expression of this gene at significantly higher or 

20 lower levels may be routinely detected in certeiin tissues and cell types (e.g., olfactory 
tissue and cartilage, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from £in individual not having the 

25 disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for tumors of connective tissues, osteoarthritis 
and the treatment and diagnosis of chondrosarcoma. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 119 

This gene is expressed primarily in Activated Neutrophils and to a lesser extent 
in fetal spleen, and CD34 positive cells from cord blood. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: allergies, defects in 
hematopoiesis and inflammation. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
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identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system and hematopoiesis system the, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., blood cells, and spleen, and cancerous and 
5 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for reducing the allergic effects felt by allergy 
suffers by neutralizing the activity of the immune system, especially since neutrophils 
are abundant in persons suffering from allergies and other inflammatory conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 120 
15 The translation product of this gene shares sequence homology with poly A 

binding protein n which is thought to be important in RNA binding for transcription of 
RN A to DN A 

This gene is expressed primarily in colon and to a lesser extent in brain and 
inunune system. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: colon cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 

25 number of disorders of the above tissues or cells, particularly of the inmiune and 

digestive system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., colon, tissue and cells of the 
immune system, and brain or other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

30 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to poly A binding protein II indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for detection 

35 and treatment of colon cancer and other disorders of the digestive system.. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 121 

The translation product of this gene shares sequence homology with thymidine 
diphosphoglucose 4.6 dehydrase which is thought to be important in the metabohsm of 

SUgJU*. 

5 This gene is expressed primarily in fetal liver and spleen and to a lesser extent in 

infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diabetes. Similarly, 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the endocrine system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certeiin tissues and cell types (e.g., liver, spleen, and brain and other tissue of the 

15 nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

20 The tissue distribution and homology to thymidine diphospoglucose 4.6 

dehydrase indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for treatment of persons with diabetes since it appears that this protein is needed 
in the metabolism of sugar in to its more basic components. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 122 

The translation product of this gene shares sequence homology with 
ceruloplasmin which is thought to be important in the metabolism and transport of iron 
and copper. Ceruloplasmin also contains domains with homology to clotting factors V 
and Vni. Defects in the circulating levels of ceruloplasmin (aceruloplasminenua) have 
30 been associated with certain disease conditions such as Wilson disease, and the 
accompanying hepatolenticular degeneration. 

This gene is expressed primarily in brain and retina and to a lesser extent in 
endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: diseases marked by 
defects in iron metabolism; aceruloplasminemia not characterized by defects in the 
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known ceruloplasnndn gene locus; nonclassical Wilson disease; movement disorders; 
and tumors derived from a brain tissue origin. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
5 the above tissues or cells, particularly of the brain, retina, and nervous system, 

expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissue of the nervous system, 
retinal tissue, and endothelial cells, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to ceruloplasmin indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for treatment of 
patients with aceruloplasminemia, or other defects in iron and/or copper metabolism. 
Mutations in this locus could also be diagnostic for patients currently experiencing or 
predicted to experience aceruloplasminemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 123 

This gene is expressed primarily in brain and B cell lymphoma and to a lesser 
extent in fetal liver and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: B cell lymphorna; 
tumors and diseases of the brain and/or spleen; hematopoietic defects. Similarly, 
polypeptides and antibodies directed to these polypeptides are^ useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the brain and 
hematopoietic system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., bmin and other tissue of 
the nervous system, blood cells, liver, and spleen, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of disorders in neuronal. 
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hematopoietic, and immune systems. It could potentially be useful for 
neurodegenerative disorders and neuronal and/or hematopoietic cell survival or 
proliferation. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 124 

This gene is expressed primarily in osteoclastoma, dermatofibrosarcoma, and B 
cell lymphoma and to a lesser extent in endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer in particular osteoclastoma, dermatofibrosarcoma, and B cell 
lymphoma. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inununological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

15 the bone, immune, and circulatory system, expression of this gene at significantly 

higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
bone, epidermis, blood cells, and endothelial cells, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

20 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of cancers and 
lymphoma; osteoporosis; and the control of cell proliferation and/or differentiation. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 125 

This gene is expressed primarily in immune tissues and hematopoietic cells, 
particularly in activated T cells and neutrophils, spleen, and fetal liver, and to a lesser 
extent in infant adrenal gland. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in T cell 
activation; hematopoietic disorders; tumors of a hematopoietic and/or adrenal gland 
origin. Similarly, polypeptides and antibodies directed to these polypeptides are useful 

35 in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
hematopoietic and/or endocrine systems, expression of this gene at significantly higher 
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or lower levels may be routinely detected in certain tissues and cell types (e.g., cells and 
tissues of the immune system, hematopoietic cells, blood cells, liver, and adrenal gland, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
5 having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for immune and/or hematopoietic disorders; 
10 diseases related to proliferation and/or differentiation of hematopoietic cells; defects in T 
cell and neutrophil activation and responsiveness; and endocrine and/or metabolic 
disorders, particularly of early childhood. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 126 

15 This gene is expressed primarily in placenta and endothelial cells and to a lesser 

extent in melanocytes and embryonic tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of an endothelial 

20 cell origin; angiogenesis associated with tumor development and metastasis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the vascular system 
and developing embryo, expression of this gene at significantly higher or lower levels 

25 may be routinely detected in certain tissues and cell types (e.g., placenta, endothelial 
cells, melanocytes, and embryonic tissues, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of developmental disorders; 
inhibition of angiogenesis; and vascular patterning. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 127 

This gene is expressed primarily in endothelial cells and hematopoietic tissues, 
including spleen, tonsils, leukocytes, and both B- and T-cell lymphomas. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of an endothelial 
cell and/or hematopoietic origin; leukemias and lymphomas. Similarly, polypeptides 

5 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune and vascular systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., endothelial cells, hematopoietic cells, spleen, 

10 tonsils, and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for thie manipulation of angiogenesis; the 
differentiation and morphogenesis of endothelial cells; the proliferation and/or 
differentiation of hematopoietic cells; and the commitment of hematopoietic cells to 
distinct cell lineages. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 128 

This gene is expressed primarily in kidney medulla and to a lesser extent in 
spleen from chronic myelogenous leukemia patients, prostate cancer, and some other 
tissues. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of a kidney 
origin; chromic myelogenous leukemia; prostate cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

30 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the kidney and spleen, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., kidney, spleen, and prostate, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

35 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of kidney 
disorders and cancer, particularly chromic myelogenous leukemia and prostate cancer. 
It may also be useful for the enhancement of kidney tubule regeneradon in the treatment 
5 of acute renal failure. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 129 

This gene is expressed primarily in adult and infant brain and to a lesser extent 
in mesenchymal or fibroblast cells, as well as tissues with a mesenchymal origin. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of a brain and/or 
mesenchymal origin; neurodegenerative disorders; cancer; fibrosis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the brain and of 
mesenchymal cells and tissues, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., brain and other 
tissue of the nervous system and cancerous and wounded tissues) or bodily fluids 

20 (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 

sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for the diagnosis of tumors of a brain and/or 

mesenchymal origin; neurodegenerative disorders; cancer; and fibrosis, based upon the 
expression of this gene within those tissues. Fibrosis is considered as mesenchymal 
cells and fibroblasts are the primary cellular targets involved in this pathological 
condition. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 130 

This gene is expressed primarily in hepatocellular cancer and to a lesser extent in 

fetal tissues as well as testes tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: liver cancer. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing 



ISOOCID: <WO_9839448A2_L> 



wo 98/39448 



82 



PCT/US98/04493 



immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the digestive system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, fetal tissue, and testes and other 
reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of liver cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 131 

This gene is expressed only in infant early brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: development and 
diseases of the nervous system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identiflcation 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the brain and nervous system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating diseases of the brain in children and in 
treating nervous system disorders such as Alzheimer's disease, schizophrenia, 
dementia, depression, etc. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 132 

This gene is expressed primarily in brain and to a lesser extent in glioblastoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Alzheinier's disease. 
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schizophrenia, depression, mania, and dementia. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain and nervous system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., brain and other tissue of the nervous system, and cancerous 
and wounded tissues) or bodily fluids (e.g., senim, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating brain disorders such as Alzheimer's 
disease, schizophrenia, depression, mania, and dementia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 133 

The translation product of this gene shares sequence homology with ribitol 
dehydrogenase of bacteria which is thought to be important in metabolism of sugars. 

This gene is expressed primarily in macrophage and to a lesser extent in T-cell 
lymphoma and lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tissue destruction in 
inflammation. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., blood cells and lung, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to ribitol dehydrogenase indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for altering 
macrophage metabolism in diseases such as inflammation where macrophages are 
causing excess tissue destruction. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 134 

This gene is expressed primarily in pancreatic tumor and to a lesser extent in 
synovial sarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological scunple and for diagnosis of diseases and conditions which include, but are 
not limited to,. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

10 the endocrine and connective tissue systems, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and ceU types (e.g., 
pancreas, and synovial tissue, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum» plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 

15 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and poljfjjeptides 
corresponding to this gene are useful for treating and diagnosing various cancers. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 135 

This gene is expressed primarily in T cell lines such as Raji and to a lesser 
extent in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: immune system 

disorders and inflammation. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inunune system, expression of this gene at significantly higher or 

30 lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

35 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating and diagnosing inflammatory diseases 
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such as rheumatoid arthritis, sepsis, inflammatory bowel disease, and psoriasis, as well 
as neutropenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 136 

5 The translation product of this gene shares high sequence homology with SARI 

subfamily of GTP-binding proteins which is thought to be important in vesicular 
transport in mammalian cells. 

This gene is expressed primarily in serum-stimulated smooth muscle cells and to 
a lesser extent in a T-cell lymphoma. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diseases affecting 
vesicular transport. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the muscular system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and smooth muscle, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

20 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individufd not having the disorder. 

The tissue distribution and homology to GTP-binding proteins indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for gene therapy 

25 in treating the large number of diseases involved in defective vesicular transport within 
cells.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 137 

The translation product of this gene shares sequence homology with a protein 
30 found in C elegans cosmid F25B5. 

This gene is expressed primarily in a fetal tissues and to a lesser extent in 
melanocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: abnormal fetal 
development, especially of the pulmonary system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
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for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the fetal pulmonary system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., fetal tissue, pulmonary tissue, and melanocytes, and 
5 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for treatment and diagnosis of diseases affecting 
the pulmonary system, such as emphysema. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 138 

This gene is expressed primarily in gall bladder and to a lesser extent in smooth 
15 muscle. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: digestive system disease 
and gall bladder problems. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the digestive system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., gall 
bladder and tissue of the digestive system, and smooth muscle, and cancerous and 

25 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for treating diseases of the digestive system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 139 

This gene is expressed primarily in placenta and to a lesser extent in brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: abnormal fetal 
development. Similarly, p>olypeptides and antibodies directed to these polypeptides are 
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useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
developing tissues, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., placenta, and brain and other 
tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating and diagnosing abnormal fetal 
development. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 140 

This gene is expressed primarily in smooth muscle and to a lesser extent in 
ovary, prostate cancer, and activated monocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hypertension and 
atherosclerosis. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the circulatory system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., smck>th 
muscle, ovary and other reproductive tissue, prostate, and blood cells, and cancerous 
and wounded tissues) or bodily fluids (e.g., semm, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating diseases of the circulatory system, 
such as hypertension, atherosclerosis, etc. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 141 

This gene is expressed primarily in fetal spleen and to a lesser extent in placenta 
and bone marrow. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: anemia and other 
diseases affecting blood cells. Similarly, polypeptides and antibodies directed to these 

5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For i^umber of disorders of the above tissues or cells, 
particularly of the circulatory and pulmonary systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., spleen, placenta, bone marrow, and blood cells, and cancerous and 

10 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

1 5 corresponding to this gene are useful for the generation of red and white blood cells and 
for the diagnosis of disease of these cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 142 

The predicted translation product of this contig is a human homolog of the 
20 murine tetracycline/sugar transporter molecule recently reported by Matsuo and 
colleagues (Biochem. Biophys. Res. Commun. 238 (1), 126-129 (1997)). 

This gene is expressed primarily in synovium and to a lesser extent in 
endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: rheumatoid arthritis and 
inflanunation. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
30 the immune and lymphatic systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., synovial 
tissue, and endothelial cells, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
35 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of inflammatory 
diseases, such as rheumatoid arthritis, leukemia, neuu-openia, inflammatory bowel 
disease, psoriasis, sepsis, and the like. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 143 

This gene is expressed primarily in placenta and to a lesser extent in melanocyte, 
fetal liver and spleen, and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: abnoraial early 
development. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, lower levels 

15 may be routinely detected in certain tissues and cell types (e.g., placenta, melanocytes, 
liver, spleen, and bone marrow, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

20 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of abnormal early 
development phenomena and diseases. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 144 
This gene is expressed primarily in fetal liver and spleen. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: anemia and neutropenia, 

30 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and blood systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., liver and spleen, 

35 and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
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expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful in hematopoeisis and bone marrow regeneration 
5 as it is most abundant in fetal tissues responsible for the generation of hematopoeitic 
cells. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 145 

The translation product of this gene shares sequence homology with protein 

10 tyrosine phosphatase which is thought to be important in transducing signal to acUvate 
cells such as T cell, B cell and other cell types. 

This gene is expressed primarily in T cells and tissues in early stages of 
development and to a lesser extent in cancers. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

1 5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immuno-related 
diseases and cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or celis, 

20 particularly of the immune system expression of this gene at significantiy higher or 

lower levels may be routinely detected in certain tissues and cell types (e.g.. embryonic 
and fetal tissue, undifferentiated cells, and blood cells, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

25 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to the protein tyrosine phosphatase family 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
modulating the immune system. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 146 

This gene is expressed primarily in T cell and to a lesser extent in B cell, 

macrophages and tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: immuno-disorders. 

Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
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providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain Ussues and cell types (e.g., blood cells, and cancerous and 
5 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
1 0 corresponding to this gene are useful for regulating the inunune system therefore can be 
used in treating diseases such as autoimmune diseases and cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 147 

This gene is expressed primarily in placenta and to a lesser extent in endothelial 

1 5 cells, testis tumor, ovarian cancer, uterine cancer. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to cancer. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or ceU type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significanUy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., placenta, 
endothelial cells, testis and ovary and other reproductive tissue, and cancerous and 

25 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for diagnosis arid treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 148 

This sequence has significant homology to mouse torsin A. Recently, another 
group cloned the human Torsin A gene. (See, Accession No. 2358279; see also Nature 
35 Genet. 17, 40-48 (1997).) 

This gene is expressed primarily in osteoclastoma, T-cell, and placenta and to a 
lesser extent in fetal lung, fetal liver, fetal brain, adult brain and tumor Ussues 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disease conditions in 
hematopoiesis and cancers. Similarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hematopoiesis system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, bone, placenta, lung, liver, and brain and other tissues of the nervous system, 
10 and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating blood related diseases such as 
deficiencies in red blood cell, white blood cell, platelet and other hematopoiesis cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 149 

20 This gene is expressed primarily in T cell, prostate and prostate cancer, 

endothelial cells and to a lesser extent in monocyte, dendritic cell, bone marrow, 
salivary gland, colon cancer, stomach cancer, pancreatic tumor, uterine cancer, fetal 
spleen and osteoclastoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immuno-related 
diseases and cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

30 particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, prostate, endothelial cells, dendritic cells, bone marrow, salivary gland, colon, 
stomach, pancreas, utems, spleen and bone, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

35 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 150 
5 This gene was recently cloned by another group, calling it eIF3-p66. (See 

Accession No. 2351378.) This gene plays a role in RNA binding and macromolecular 
assembly, and therefore, any mutations in this gene would likely result in a diseased 
phenotype. Preferred polypeptide fragments comprise the amino acid sequence: 
MAKFMTPVIQDNPSGWGPCAVPEQFRDMPYQPFSKGDRLGKVADWTGATYQDKRYTNKYSS 

1 0 QFGGGSQYAYFHEEDESSFQLVDTAJlTQKTAYQRNRMRFAQR^a-RRDKDRR^^ 

KSAKQKERERIRLQKKFQKQFGVRQKWDQKSQKPRDSSVEVRSDWEVKEEMDFPQLMKMRY 
LEVSEPQDIECCGALEYYDKAFDRITTTISEKPLRXXKRIFHTVTT^ 
AILATU^SCTRSVYSWDIWQRVGSKLFFDKRDNSDFDLLTVSETANEPPQDE^ 
AMEATYINHNFSQQCLRMGKERYNFPNPNPFVEDDMDK^^EIASVAYRYRSG^^ 

1 5 EHDGVMTGANGEVSFI^^KTL^nEWDSRHCNGVDWRQKI.DSQRGAVlATELK^ 

CALLAGSEYLKLGYVSRYHVKDSSRHVILGTQQFKPNEFASQINLSVENAWGILRCVIDICM^ 
EEGKYLILKDPNKQVIRVYSLPDGTFSS (SEQIDNO: 638). as well as N-terminal and C- 
terminal deletions of this polypeptide fragment. 

This gene is expressed primarily in T cell, bone marrow, embryo and 

20 endothelial cells and to a lesser extent in testis tumor and endometrial tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune diseases and 
tumors. Similarly, polypeptides and antibodies directed to these polypeptides are useful 

25 in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system and reproductive system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

30 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for immune disorders and cancers. 

35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 151 

This gene is expressed primarily in testis and to a lesser extent in T cell, spinal 
cord, placenta, neutrophil and monocyte. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: male reproductive and 
endocrine disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

10 particularly of the reproductive, immune and endocrine systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., testis and other reproductive tissue, blood cells, tissue of the 
nervous system, and placenta, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 

15 sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for regulating immune and reproductive functions. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 152 

The translation product of this gene shares sequence homology with tyrosyl- 
tRNA synthetase which is thought to be important in cell growth. 

This gene is expressed primarily in brain, liver, keratinocytes, tonsils, and 

25 heart. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer autoimmune diseases. Similarly, polypeptides and antibodies 

30 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, liver, keratinocytes, tonsils, heart 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissues of the nervous system, 

35 liver, keratinocytes, tonsils and heart, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
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gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to tyrosyl-tRN A synthetase indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for modulating 
5 cell growth. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 153 

This gene is homologous to the Drosophila transcriptional regulator dre4. (See 
Accession No. 251 1745,) Dre4 is a gene required for steroidogenesis in Drosophila 

10 melanogaster and encodes a developmentally expressed homologue of the yeast 

transcriptional regulator CDC68. Preferred polypeptide fragments comprise the amino 
acid sequence: KKRHTDVQFYTEVGEriTDLGKHQHMHDRDDLYAEQMEREMRHKL^ 
FmKVEALTKEELEFEVPFRDLGFNGAPYRSTCLLQPTSSALVNATEWPPFV\ril.DE\^ 
VQFHLKNFDMVrV^KDYSKKVTMINAIPVASLDPIKEWLNSCDLKYTC 

15 DPEGFFEQGGWSFL (SEQ ID NO: 639), as well as N-terminal and C-terminal deletions of 
this fragments. Also preferred are polynucleotide fragments encoding this polypeptide 
fragment. 

This gene is expressed primarily in fetal liver, spleen, placenta, lung, T cell, 
thyroid, testes. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: brain tumor, heart and 
liver diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the fetal liver, spleen, placenta, lung, T cell, thyroid, testes expression of this gene at 
significantly higher or lower levels may be routinely detected in certzdn tissues and cell 
types (e.g., liver, spleen, placenta, lung, blood cells, thyroid, and testes and other 
reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
30 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 



35 
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This gene is expressed primarily in brain and to a lesser extent in fetal heart, 
testis, spleen, lung. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: heart, liver and spleen 
diseases, immunological diseases. Similarly, polypeptides and antibodies directed to 
5 these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the brain, fetal heart, testis, spleen, lung expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., brain and other tissue of the nervous system, heart, testes 
10 and other reproductive tissue, spleen, and lung, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 155 

Activation of T cells through the T cell antigen receptor (TCR) results in the 
rapid tyrosine phosphorylation of a number of cellular proteins, one of the earliest being 
a 1(X) kDa protein. This gene is the human equivalent of murine valosin containing 
20 protein (VCP). VCP is a member of a family of ATP binding, homo-oligomeric 

proteins, and the mammalian homolog of Saccharomyces cerevisiae cdc48p, a protein 
essential to the completion of mitosis in yeast. Both endogenous and expressed murine 
VCP are tyrosine phosphorylated in response to T cell activation. Thus we have 
identified a novel component of the TCR mediated tyrosine kinase activation pathway 
25 that may provide a link between TCR activation and cell cycle control. 

This gene is expressed primarily in brain, liver, spleen, placenta. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
30 not limited to, cancer immunological disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, liver, spleen, placenta expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
35 tissues and cell types (e.g., brain and other tissue of the nervous system, liver, spleen, 
and placenta, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 157 

This gene is expressed primarily in Glioblastoma, endometrial tumor, 
lymphoma and pancreas tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Glioblastoma, 
Endometrial tumor, lymphoma and pancreas tumor. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

10 of the above tissues or cells, particularly of the immune, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., endometrium, lymphoid tissue, pancreas, and tissue of the nervous system, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 

15 having such a disorder, relative to the standard gene expression level, i.e., the 

expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 158 

20 The translation product of this gene shares sequence homology with IGE 

receptor which is thought to be important in allergy and asthma. 
This gene is expressed primarily in T cell, and fetal liver. 
Therefore, polynucleotides and polypeptides of the invention are usefid as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: allergy and asthma and 
other immunological disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune, expression of this gene at significantly higher or lower 

30 levels may be routinely detected in certain tissues and cell types (e.g., blood cells, and 
liver, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spineil fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 

35 disorder. 



wo 98/39448 



PCT/US98/04493 



105 

This gene is expressed primarily in human pituitary gland and to a lesser extent 
in colorectal cancer tissue. This gene has also been observed in the LNCAP cell line. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions: hyperlipidemias of 
familial and/or idiopathic origins. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly blood, expression of this gene at significantly higher or 

10 lower levels may be routinely detected in certain tissues and cell types (e.g., pituitary 
and colon, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

15 disorder. 

The tissue distribution and homology to rat cytosolic acyl coenzyme-A 
hydrolase indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for the detection or treatment of hyperlipidemia disease states by virtue of the 
ability of specific drags to activate the enzyme. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 170 

The translation product of this gene shares sequence homology with a 
Caenorhabditis elegans gene which is thought to be important in organism 
development. 

25 This gene is expressed primarily in human synovial sarcoma tissue, bone 

marrow, and to a lesser extent in human brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

30 not limited to, of bone, specifically synovial sarcoma. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the bone, connective tissues and possibly 
immune system, expression of this gene at significantly higher or lower levels may be 

35 routinely detected in certain tissues and cell types (e.g., synovial tissue, bone marrow, 
brain and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., seram, plasma, urine, synovial fluid or spinal fluid) or another 
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tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to Caenorhabditis elegans indicate that 
5 polynucleotides and polypeptides corresponding to this gene are useful as a diagnostic 
and/or therapeutic modality directed at the detection and/or treatment of connective 
tissue sarcomas or other related bone diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 171 

10 The translation product of this gene shares sequence homology with betal- 

6GlcNAc transferase which is thought to be important in the transfer and metabolism of 
betal-6, N-acetylglucosamine. This gene product has previously been shown to 
suppress melanoma lung metastasis in both syngeneic and nude mice, decreased 
invasiveness into the matrigel, and inhibition of cell attachment to collagen and laminin 

15 without affecting cell growth. 

This gene is expressed primarily in human testes and prostate tissues, and to a 
lesser extent in kidney, medulla, and pancreas. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly rrielanoma. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing inmiunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 

25 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., testes and other reproductive tissue, prostate, kidney, pancreas, brain and 
other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

30 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to betal-6GlcNAc transferase indicate that 
the protein product of this clone is useful for the development of diagnostic and/or 
therapeutic modalities directed at the detection and/or treatment of cancer, the metastasis 

35 of malignant tissue or cells. Defects in this potentially secreted enzyme may play a role 
in metastasis. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 172 

This gene is expressed primarily in fetal spleen and liver. 

Therefore, polynucleotides and polypeptides of the invention are usefvil as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune disorders, 
Wilm*s tumor disease, hepatic disorders, and hematopoietic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
10 number of disorders of the above tissues or cells, particularly of the hematopoiesis and 
immune systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., spken and liver, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
15 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid firom an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and identification of fetal defects 
along with correcting diseases that affect hematopoiesis and the immune system. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 173 

The translation product of this gene shares sequence homology with ret II 
oncogene which is thought to be important in Hirschsprung disease and many types of 
cancers. 

25 This gene is expressed in multiple tissues including the lymphatic system, brain, 

and thyroid. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
and for diagnosis of diseases and conditions: Hirschsprung disease and multiple 

30 cancers. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune and 
central nervous system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., lymphoid tissue, 

35 thyroid, and brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
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the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to ret H oncogene indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
5 and treatment of various cancers. It would also be useful for the diagnosis and treatment 
of Hirschsprung disease. Preferred polypeptides of the invention comprise the amino 
acid sequence: MEAQQVNEAESAREQLQXLHDQIAGQKASKQELETELERLKQEFHYIEEDLY 
RTKNrn.QSRIKDRDEEIQKLRNQLTNKTLSNSSQSELENRLHQLTETLIQ 
VFQLERLEQQMNSASGSSSNGSSINMSGIDNGEGTRUWVPVLFNDTET^^ 
1 0 SIDQFSIRLGIFLRRYPIARVFVIIYMALLHLWVMIVLLTYTPEM HHOQPYGK (SEQ ID NO: 
642). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 174 

The translation product of this gene shares sequence homology with testis 
15 enhanced gene transcript which is thought to be important in regulation of human 
development. 

This gene is expressed primarily in infant brain and to a lesser extent in a variety 
of other tissues and cell types, including the prostate, testes, monocytes, macrophages, 
dendritic cells, keratinocytes, and adipocytes. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological, 
developmental, inunune and inflammation disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inununological probes 

25 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the brain and immune systems, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., brain and other tissue of the nervous system, prostate, 
testes and other reproductive tissue, blood cells, keratinocytes, and adipocytes, and 

30 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to testis enhanced gene transcript indicate 

35 that the protein product of this clone is useful for diagnosis and treatment of disorders 
involving the developing brain and the inunune system. 



>DOCtD:<WO l»ftr»M4AA9 I ^ 



wo 98/39448 



109 



PCTAJS98/04493 



FEATURES OF PROTEIN ENCODED BY GENE NO: 175 

This gene is expressed primarily in prostate and to a lesser extent in various 
other tissues, including placenta. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancers, especially of the prostate. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the prostate, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., prostate and placenta, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 

15 sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of prostate disorders and cancer. It may also be useful for 

20 the diagnosis and treatment of endocrine disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 176 

The translation product of this gene shares sequence homology with 
Sacchromyces cerevisiae YNT20 gene which is thought to be important in 

25 mitochondrial function. 

This gene is expressed at a particularly high level in muscle tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases related to such tissues and cell types 

30 including: muscle wasting diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the neuromuscular system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 

35 types (e.g., muscle and cancerous and wounded tissues) or bodily fluids (e.g., semm, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
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type(s). For a number of disorders of the above tissues or cells, particularly of the CNS 
disorders and retinopathy, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, and retinal tissue, and cancerous and wounded tissues) or bodily 
5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to caldesmon indicate that polynucleotides 
10 and polypeptides corresponding to this gene are useful for treatment of retinopathies. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 179 

The translation product of this gene shares sequence homology with mouse 
fibrosin protein which is thought to be important in regulation of fibrinogenesis in 

15 certain chronic inflanunatory diseases. 

This gene is expressed primarily in amniotic cells and breast tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of breast cancer and abnormal embryo 

20 development. Similarly, polypeptides and antibodies directed to these p)olypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., anmiotic cells, and 

25 mamm£iry tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

30 The tissue distribution and homology to fibrosin indicate that the protein product 

of this clone is useful for treatment of breast cancer. This gene or its protein product 
could be used in replacement therapy for breast cancer. In addition the protein product 
of this gene is useful in the treatment of chronic inflammatory diseases. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 180 

This gene is expressed several infant tissues including brain and liver and 
various adult tissues including brain, lung, liver, testes, and prostate. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, brain cancer, lung cancer, liver cancer and cancers of the reproductive 
5 system. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, hepatic system, and reproductive system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 

10 and cell types (e.g., brain and other tissue of the nervous system, lung, liver, testes and 
other reproductive tissue, and prostate, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

15 individual not having the disorder. 

The tissue distribution of this gene product indicates that the protein product of 
this clone is involved in growth regulation and could be used as a growth factor or 
growth blocker in a variety of settings including treatment of cancers. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 181 

This gene is expressed primarily in activated monocytes and to a lesser extent in 
melanocytes and dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of immune system diseases and cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the inunune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 

30 in certain tissues and cell types (e.g., blood cells, melanocytes, and dendritic cells, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

35 The tissue distribution indicates that the protein product of this clone could be 

involved in growth regulation and could be used as a growth factor or growth blocker 
in a variety of settings including treatment of cancers. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 182 

This gene is expressed primarily in placenta and several tumors of various tissue 
origin and to a lesser extent in normal tissues including liver, lung, brain, and skin, 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of cancers of all kinds. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

10 of the above tissues or cells, particularly of the central nervous system, respiratory 

system and skin, expression of this gene at significandy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., liver, lung, brain and other 
tissues of the nervous system, and skin, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

1 5 cell sample taken from an individual having such a disorder, reladve to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The high expression of this gene in multiple tumors indicates that the protein 
product of the clone may be involved in cell growth control and therefore would be 

20 useful for treatment of certain cancers. Likewise molecules developed to block the 

activity of the protein product of this clone could be used to block its potential role in 
tumor growth promotion. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 183 

25 The translation product of this gene shares sequence homology with the mouse 

Ndrl gene which is thought to be important in cancer progression. 

This gene is expressed multiple cell types and tissues including brain, lung, 
kidney, bone marrow, liver, and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

30 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biologiced sample and for diagnosis of all types of cancers. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the nervous, immune, and endocrine 

35 systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., brain and other tissue of the nervous 
system, lung, kidney, bone marrow, liver and spleen, and cancerous and wounded 
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tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder, 
5 The tissue distribution and homology to Ndrl gene, which is thought to be 

involved in cancer progression, indicate that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of certain cancers. Likewise 
molecules developed to block the activity of the protein product of this clone could be 
used to block its potential role in tumor growth promotion. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 184 

This gene is expressed primarily in early stage human brain and liver and to a 
lesser extent in several other fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: brain and liver cancers. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

20 central nervous system and inmiune system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, liver, and fetal tissue, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 

25 relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The expression of this gene in embryonic tissues indicates that the protein could 
be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settings including treatment of cancers. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 185 

This gene is expressed primarily in infant and embryonic brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of degenerative nervous system disorders and brain 
cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., embryonic tissue, brain 
and other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The expression of this gene in embryonic tissues indicates that the protein could 
10 be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settings including treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 186 

This gene is expressed primarily in multiple tissues including placenta, fetal 

15 lung, fetal liver, and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of all types of cancers including liver, brain and 
lung. Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, pulmonary system, and hepatic system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., placenta, lung, liver, and brain and other tissue of the nervous system, 

25 and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

30 The expression of this gene in embryonic tissues indicates that the protein could 

be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settings including treatment of cancers. 



ISDOCID: <WO__983944aA^I_> 



wo 98/39448 



116 



PCT/US98/04493 







^ 1 




*n < 
"n 


30 


50 
CN 


Predicted 
First AA 

01 

Secreted 
Portion 




— < ( 


-n 




CN 


m 

CN 




OO 


30 


m 




CN 
































cn 
t— « 
m 




2) 

cn 


S 


m 


s 


5' NT 
of 

rirsi 

AAof 

Signal 

Pep 








ON 




»o 
cn 


D IN 1 

of 

Start 
Codon 




s 


oo 






*n 
cn 


3' NT 

of 
Clone 
Seq. 


oo 
<o 


o 
cn 
oo 


»o 
so 
Ti- 


cn 
cn 




ON 

cn 


5' NT 

of 
Clone 
Seq. 




VO 

o\ 




ON 

CN 
CS 






Total 
NT 
Seq. 


OO 


1020 


SO 


CN 

»o 




CN 

cn 
cn 






ON 




OO 

o\ 


m 


ON 
ON 


Vector 


X 

< 

M 
1 

a 


X 

< 
1 

B 


Uni-ZAPXR 


Uni-ZAPXR 


ZAP Express 


ZAP Express 


ATCC 
Deposit 
No:Z 
and Date 


97897 
02/26/97 
209043 

AC/1 c/na 


97897 
02/26/97 
209043 


97897 
02/26/97 
209043 

Uj/ ID/y / 


97897 
02/26/97 
209043 


97897 
02/26/97 
209043 

UJ/ 1 J/7 / 


97897 
02/26/97 
209043 


cDNA 
Clone ID 


1 


i 


HBGBW52 


HBGBW52 


u 


HCUFM41 


Gene 
No. 






CM 


(N 


cn 


cn 



JOOCID: <WO__983944aA^L> 



pCr/US98/04493 




wo 98/39448 



118 



PCT/US98/04493 





-* < 




s 


> < 






Predicted 
First AA 
or 

Secreted 
Portion 






ro 




■o 






ao 




O 
CO 


< 


-o 
































o 


cs 
ro 


CO 
<N 
CO 


ro 
O 


ro 


s 


5' NT 
of 

rirsi 

AAof 

Signal 

Pep 


•— 1 


o 
as 


VO 


ON 


s 


ON 

o 


NT 
^ 1> 1 

of 

Start 
Codon 




o 

ON 


cs 

NO 




s 


ON 

o 


3' NT 

of 
Clone 
Seq. 


c — 




ON 

On 


B 


CO 

VO 


CO 

Er> 


5' NT 

of 
Clone 
Seq. 


On 






CO 
CN 






Total 
NT 
Seq. 


VO 

CO 




On 

ON 


CS 

ON 


CO 
VO 


o\ 
oo 


izi z 


i 


o 




o 

CM 


cs 

<N 


<N 

o 

<N 


Vector 


-ZAPXR 


-ZAPXR 


-ZAPXR 


-ZAPXR 


)Sportl 


)Sportl 




Urn 


Uni 


Uni 


c 

:=> 






ATCC 
Deposit 
No:Z 
and Date 


978^7 
02/26/97 
209043 

r\c/i c/m 


97897 
02/26/97 
209043 

AC /I c'mn 
05/15/97 


97897 
02/26/97 
209043 
UD/ij/y/ 


97897 
02/26/97 
209043 

AC/1 C/Q7 


97897 
02/26/97 
209043 

nc/1 C/Ol 


97897 
02/26/97 
209043 


cDNA 
Clone E) 


HEIAX07 

i 


VO 

X 

< 

CO 


HNGJJ68 


oo 
VO 
1—, 
»— > 

o 
z 


HCFAW04 


HCFAW04 


Gene 
No. 


OS 


o 











SDOCID: <WO ^983944aA2J_> 



wo 98/39448 



119 



PCT/US98/04493 



35^1 


»o 




o 


o 
cs 


oo 

VO 


VO 
CO 


oo 


Predicted 
First AA 
or 

Secreted 
Portion 




m 




CO 


CM 


VO 
CM 


CO, 






o 
CO 




CO 

CO 


CM 


WO 
CN 


o 

CO 




















m 


vn 


VO 
O 


O 


VO 
CM 
CO 


OO 

o 


CM 
CO 


5" NT 
of 

AAof 
Signal 
Pep 


O 


r- 
oo 


o 

OS 
VO 


8 


1242 


CO 

o 

CO 


s 

CO 


5' NT 

»J ill 

of 

Start 
Codon 


CN 

o 


oo 




O 

o 


1242 


CO 

o 

CO 


s 

CO 


3' NT 

of 
Clone 
Seq. 


1418 


On 
CO 
OO 


O 
oo 


1354 


2059 


1226 


CO 

oo 
VO 


5' NT 

of 
Clone 
Seq. 


ON 




*o 




1017 


CO 




Total 
NT 
Seq. 


I486 


oo 


<N 
oo 


1354 


2323 


1378 


CO 

oo 
VO 




CO 


CO 

o 

CM 


s 

CM 


to 
o 

CM 




VO 


to 

CM 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 

i 


Uni-ZAPXR 

1 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97897 
02/26/97 
209043 


02/26/97 
209043 
u J/ ijiyi 


97897 
02/26/97 
209043 
fl5/1 VQ7 


209235 
09/04/97 


97897 
02/26/97 
209043 
0*5/1 VQ7 

yjjl IJIy 1 


97897 
02/26/97 
209043 

Uj/ ij/y/ 


97897 
0*2/26/97 
209043 

Uj/ 1 j/7 / 


cDNA 
Clone ID 


HLMAV65 


HLMAV65 


HLMAV65 


HTXEF04 


HPMFD84 


HPMFD84 


HE6DB26 


Gene 
No. 


CO 

1—1 


CO 


CO 


CO 






VO 



tSOOaO: <WO_9839448A2_L> 



wo 98/39448 



120 



PCT/US98/04493 





On 


VO 


cn 
VO 


CO 


CO 


CO 
CN 


Predicted 
First AA 
or 

Secreted 
Portion 


ON 




CO 


CN 
CO 


v-> 
CN 


o 

CN 




oo 


o 
<^ 


O 

CO 


CO 




On 
















^ CO Z 


ON 
O 

lO 


oo 


ON 
CN 

CO 


o 


O 
CO 
CO 


CO 
CO 


5' NT 
of 

Pi ret 

rirsi 
AAof 
Signal 
Pep 


vo 
m 




o 
r- 


CO 
CO 


o\ 

CO 


vo 


S' NT 

of 

Start 
Codon 


r- 




o 


CO 
CO 


ON 
CO 


vo 


3' NT 

of 
Clone 
Seq. 


oo 
oo 


1959 


r- 


On 
vo 


*o 

ON 


vo 

*o 
»n 


5' NT 

of 
Clone 
Seq. 


oo 






CN 






Total 
NT 
Seq. 


1166 


vo 

CO 

o 




ON 

vo 


ON 


vo 

»o 




o 
cs 


VO . 


r- 

CN 


oo 
o 

CN 


oo 
cs 


On 
CN 




X 


X 


X 


X 


X 


X 


Vectoi 


[-ZAP 


i-ZAP 


t-ZAP 


i-ZAP 


i-ZAP 


i.ZAP 




c 

p 


c 


B 


c 
D 


c 


c 
D 


ATCC 
Deposit 
No:Z 
and Date 


97897 
02/26/97 
209043 

uj/i j/y / 


,97897 
02/26/97 
209043 

UD/ ij/y/ 


97897 
02/26/97 
209043 
yjji uiy 1 


97897 
02/26/97 
209043 


97897 
02/26/97 
209043 


97897 
02/26/97 
209043 

vJ/ 1 J/7 / 


cDNA 
Clone ID' 


HE6DB26 


HHFFL33 


HODBD33 


HODBD33 


1 


HOUAWOl 


Gene 
No. 




vo 






oo 


—4 



wo 98/39448 



121 



PCT/US98/04493 





o 




oo 
cs 


2 


O 


VO 
CM 


Predicted 
First AA 

or 

Secreted 
Portion 


VO 

m 


m 


oo 
<s 


CM 
CM 


ro 


SO 
CM 


^ < w) a. 


cn 


O 

CO 


CN 


w« 
CM 


O 
CO 


to 

CM 


^00 a.. 
















Ol 

m 
cn 


CO 

CO 
CO 




Tj- 
CO 

CO 


ro 
ro 


CM 

In 


5' NT 
of 

AAof 
Signal 
Pep 


oo 


oo 


oo 

CO 


CO 


CO 


oo 


5' NT 

^ 111 

of 

Start 
Codon 


GO 


r- 
oo 


oo 

CO 


CO 


CO 


GO 


3' NT 

of 
Clone 
Seq. 






CM 
CO 
Cs 


VO 

oo 




s 

VO 


5' NT 

of 
Clone 
Seq. 






r- 

CM 








Total 
NT 
Seq. 


cn 




CM 
CO 

o\ 


VO 

oo 


CM 


VO 
VO 




o 
cn 


»— « 

CO 


On 
O 
Ol 


CM 
CO 


CO 
CO 


o 

CM 


Vector 


Uni-ZAPXR 


pSportl 


pSportl 


pCMVSport 
2.0 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 

and Date 


97897 
02/26/97 
209043 

UJ/IJ/7/ 


97897 
02/26/97 
209043 

KJj/lJty/ 


97897 
02/26/97 
209043 
OS/ 1^5/97 


97897 
02/26/97 
209043 

yjjiijiy 1 


97897 
02/26/97 
209043 
OS/ 1 5/97 


97897 
02/26/97 
209043 


cDNA 
Clone ID 


HBJAE44 


HCFMB4I 


HCFME41 


H0GC071 


OO 

o 

CO 

O 


HOSEX08 


Gene 
No. 


o 




CM 


CM 
CM 


CO 
CM 


CO 

CM 



ISDCX^tD: <WO_9839448A2J_> 



wo 98/39448 



122 



PCT/US98/04493 





OO 




CO 

CO 


vo 




CO 


Predicted 
First AA 

01 

Secreted 
Portion 




ro 




CM 


ro 






o 

CO 


o 

ro 


CS 


o 


o 

CO 


o 


















ro 

CO 


CO 
CO 


CO 


oo 
ro 

CO 


xf 


On 
CO 
ro 


5' NT 
of 

AAof 
Signal 
Pep 






cs 


CO 


* 


O 


5' NT 
of 

Start 
Codon 




so 
0\ 


r- 








3' NT 

of 
Clone 
Seq. 


CO 


CO 
ON 


xf 
CO 


NO 


On 
O 

<o 


ON 
Xf 
CO 


5- NT 

of 
Clone 
Seq. 














Total 
NT 
Seq. 


CO 


CO 
ON 


ON 


s 

vo 


oo 

CO 
On 


ON 

XT 
CO 




CO 


*o 

CO 




vo 
CO 


<N 


t-- 
CO 


Vector 


pBluescript 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97897 
02/26/97 
209043 

AC/1 C/QT 


97898 
02/26/97 
209044 


97898 
02/26/97 
209044 

U j/ i J/ 7 / 


97898 
02/26/97 
209044 

AC/1 </Q7 


97898 
02/26/97 
209044 


OO ON 3- ^ 

^ ^ s ^ 
o o 


cDNA 
Clone ID 


HSKNJ72 


On 
VO 

PQ 
U3 
CQ 

ac 


ON 

vo 
CQ 
U4 
OQ 


oo 

£ 

vo 

w 


oo 

w 


HSAUZ47 


Gene 
No. 






wo 


vo 


VO 
CM 


r- 
cs 



SOOCID: <WO_9839448A^_> 



wo 98/39448 



123 



PCT/US98/04493 







VO 
CN 


VO 






o 

(N 

»n 


Predicted 
First AA 
or 

Secreted 
Portion 


o\ ■ 


CN 


\r\ 

CN 


CO 


CN 


CN 




oo 

CO 


O 
CN 


CN 


o 

CO 


CO 














*— ( 






cn 


m 


CO 


CO 
CO 


|0 

In 


CO 


5' NT 
of 

AAof 
Signal 
Pep 




o 

CO 




r- 

CN 


ON 

CO 


CN 


5' NT 

^ 111 

of 

Start 
Codon 


CM 


ON 

o 

CO 


2; 






CN 


3' NT 

of 
Clone 
Seq. 


VO 


1908 


oo 


1153 


VO 

ON 

oo 


1983 


5' NT 

of 
Clone 
Seq. 




*n 

CO 


CO 
ON 


8 


CN 

o 
«n 


1092 


Total 
NT 
Seq, 


cs 


1908 


oo 


1153 


1079 


1983 




oo 
m 


ON 
CO 


O 




CO 

o! 


CN 


Vector 


Uni-ZAPXR 


X 

< 

"If 

c 


pSportl 


pSportl 


pSportl 


X 

PL, 
< 

• 


ATCC 
Deposit 
No:Z 
and Date 


97898 
02/26/97 
209044 

05/15/y/ 


97898 
02/26/97 
209044 

Uj/ 1 J/7 / 


97898 
02/26/97 

209044 
OS/1 5/97 


97898 
02/26/97 
209044 


97898 
02/26/97 
209044 


97898 
02/26/97 
209044 
fiS/1 S/Q7 


cDNA 
Clone ID 


HSSDM73 


HBMVK68 


HMKDC66 


HMKCU94 


HMKCU94 


a: 


Gene 
No. 


oo 


C3N 


o 

CO 


CO 


CO 


CN 
CO 



4SDCXaD: <WO 983d44aA2J_> 



wo 98/39448 PCT/OS98/04493 

124 







3N 


^ i 






■o 


Predicted 
First AA 

01 

Secreted 
Portion 


< 








cs 


"O 

-o 




< 


— « 


m 




— • 


-o 


















>o 


m 


VO 

CO 




ro 


x> 

lO 


5' NT 
of 
rirsi 
AAof 
Signal 
Pep 


2030 


3\ 




ro 


' * 


OO 


*5' NT 
Jill 

: of 

Start 
Codon 












OO 


3' NT 
■ of , 
Clone 
Seq. 


3357 


OS 
VO 


1153 


1036 


1569 


1404 


5' NT 

of 
Clone 
Seq. 


2757 




oo 


cs 

<N 
OO 


OO 
VO 


o 


Total 
NT 
Seq. 


3791 


1406 


1391 


1334 


1569 


1511! 










o3 


lO 


CM 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


"97898 
02/26/97 
209044 
05/15/97 


97898 
02/26/97 
209044 


97898 
02/26/97 
209044 

AC/1 c/nT 


97898 
02/26/97 
209044 

05/15/y/ 


97898 
02/26/97 
209044 

05/15/97 


97898 
02/26/97 
209044 

yjjfij/yf 


cDNA 
Clone ID 


HRDEW41 


VO 

o 

z 
o 


HBGDA21 


< 
Q 
O 
PQ 
33 


HFGAK75 


HFGAK75 


Gene 
No. 


m 


m 

CO 


CO 


CO 


CO 


VO 
CO 



'.OOCID: <WO_9839448A2_L> 



wo 98/39448 PCTAJS98/04493 

125 



1^ o| 




oo 


CM 


CM 


CO 




Predicted 
First AA 
or 

Secreted 
Portion 


o 

CM 


oo 

CO 


O 


CO 




CO 




On 


CO 


On 


CO 




CO 










p— 1 






?gag>- 


OO 
CO 


On 
CO 


O 
»o 

CO 


*— 1 

wo 

CO 


ON 

wo 


CM 

»n 

CO 


5' NT 
of 

Pircf 

rub I 

AAof 
Signal 
Pep 


CM 

VO 




VO 




oo 


VO 


S' NT 
J ill 

of 

Start 
Codon 


CM 

VO 




VO 








3' NT 

of 
Clone 
Seq. 


1681 


NO 

On 
CO 


VO 
CO 


1300 


wo 


1404 


5" NT 

of 
Clone 
Seq. 




CM 
CM 




CM 

CO 

oo 


CN 
ON 
t 


o 


Total 
NT 
Seq. 


1924 


*o 


VO 

CO 


1366 


CM 

S 


1405 


CO Z 


VO 
rt 




oo 




CM 


o 
wo 


Vector 


a: 
X 

< 

N 

1 

"S 

p 


pSportl 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97898 
02/26/97 
209044 


97898 
02/26/97 
209044 
vji ij/y / 


97898 
02/26/97 
209044 


97898 
02/26/97 
209044 

LJiy 1 


97898 
02/26/97 
209044 

05/1 5/97 

u J/ ijiyt 


97898 
02/26/97 
209044 
OS/lS/97 


cDNA 
Clone ID 


HHPBD40 


HOVCL83 


HBCAY62 


HBICM48 


HB1CM48 


HLTCL35 


Gene 
No. 


CO 


CO 


OO 
CO 


ON 
CO 


ON 

CO 


O 



ISDOCID: <WO_983944aA2_l_> 



wo 98/39448 



126 



PCTAJS98/04493 





o 

m 




CM 
ir» 


CN 

ro 




s 

CN 


Predicted 
First AA 

or 
Secreted 
Portion 


CM 

cs 




lO 
CN 




CN 


CO 




?J 






o\ 
lO 


<o 

CN 


o 

CO 


















o 
cs 
»r» 


m 
m 


m 


IT) 


lO 


so 
CO 


5' NT 
of 

rirsi 

AAof 

Signal 

Pep 


cs 
r~ 


CN 
CN 
CN 


m 




ON 
ON 

ro 


VO 
VO 


S' NT 
of 

Start 
Codon 




CN 
CN 
CN 


CO 




On 
On 
m 


VO 
VO 


3' NT 

of 
Clone 
Seq. 


1241 


oo 




C3n 


VO 
oo 
VO 


1749 


5' NT 

of 
Clone 
Seq. 










VO 
oo 


CN 
CN 
CN 


Total 
NT 
Seq. 


1241 


o 


LLL 


CN 
O 
VO 


0801 


1749 




OO 
CN 


»o 


<N 


m 


C3N 
CN 


to 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


ZAP Express 


Uni-ZAP XR 


Uni^ZAPXR 


X 

1 

'3 

D. 


ATCC 
Deposit 
No:Z 
and Date 


97898 
02/26/97 
209044 


r-* ^ 
oo On 5 ^ 


97899 
02/26/97 
209045 


97899 
02/26/97 
209045 


97899 
02/26/97 
209045 


97899 
02/26/97 
209045 


cDNA 
Clone ID 


HLTCL35 


HLHCK50 


HRSAN45 


HSNBB14 


z 

CO 

SC 


HMABL38 


Gene 
No. 


O 




CN 




CO 


5 



aXXDtO: <WO_983944aA2. 1 > 



wo 98/39448 



127 



PCTAJS98/04493 





o r 


^ r 


■o c 




3 < 


7N 


Predicted 
First AA 

of 

01 

Secreted 
Portion 


1 C 




7n V 

"H < 


o 

s < 








^ c 




— < < 


* 1 V 

N < 


"O 




e < «s -2? S* 




















X) < 


^ < 
O < 


3n 

o 




5- NT 
of 

AAof 
Signal 
Pep 


cs 






O ( 






NT 
J 1^ 1 

of 

Start 
Codon 


\n 


o 






30 




3' NT 

of 
Clone 
Seq. 


O 

ON 




m 


m 

ON 


o 


CO 


5' NT 

of 
Clone 
Seq. 


ON 


o\ 


*o 

to 


to 
*o 


o 

ON 
NO 




Total 
NT 
Seq. 


OO 


VO 

ON 

oo 


cn 


CO 
On 


o 


NO 
ON 


^ CO Z 


o 


to 








CM 
CM 
<M 


Vector 


Uni-ZAPXR 


Uni-ZAP XR 


Uni-ZAPXR 


Uni-ZAP XR 


pCMVSport 
2.0 


pCMVSport 
2.0 


ATCC 
Deposit 

and Date 


97899 
02/26/97 
209045 

05/lJ/y/ 


97899 
02/26/97 
209045 


97899 

02/26/97 

209045 
^/Q7 


97899 
02/26/97 
209045 


97899 
02/26/97 
209045 


97899 
02/26/97 
209045 

AC /I </Q7 


cDNA 
' Clone ID 


HMABL38 


HSKDK47 


HOSFH03 


HOSFH03 


HOGAV75 


HOGAV75 


Gene 
No. 


5 






NO 


5 





l<yXX:iDt<WO 983d4ddA2 I > 



wo 98/39448 



128 



PCT/US98/04493 





OO 




O 


CN 


CN 




Predicted 
First AA 

of 
Secreted 
Portion 


m 


CO 


On 


cs 

CN 


CO 


ON 




CO 
CO 


o 

CO 


oo 


CN 


o 

CO 


OO 








1—1 










s 

CO 


CO 


m 


CN 
NO 
CO 


CO 

NO 
CO 


VO 
CN 


5' NT 
Of 

rirsi 

AAof 

Signal 

Pep 


CO 
CO 


o\ 
oo 






o 

ON 


1953 


^' NIT 
J ill 

of 

Start 
Codon 


CO 
CO 


a\ 
oo 


1164 




o 

ON 


1953 


3' NT 

of 
Clone 
Seq. 


1049 


1737 


1791 


CO 


2888 


2517 


5' NT 

of 
Clone 
Seq. 


CM 

CO 


vo 
oo 


ON 
ON 




1909 


1597 


Total 
NT 
Seq. 


1049 


VO 


1791 


CO 


2888 


2517 




OO 


ON 


CO 

CNJ 

cs 






T^ 

CN 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


pBIuescript 
SK- 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97899 
02/26/97 
209045 


97899 
02/26/97 
209045 
05/15/97 


97899 
02/26/97 
209045 
05/15/97 


97899 
02/26/97 
209045 
05/15/97 


97899 
02/26/97 
209045 
05/15/97 


97899 
02/26/97 
209045 
05/1 S/97 

\j*jf Ljfy t 


cDNA 
Clone ID 


HFCAI74 


HAGBI17 


HAGBI17 


HLFBC91 


HPRCA31 


HPRCA31 


Gene 
No. 


OO 


On 


ON 


o 







wo 98/39448 



PCT/US98/04493 



35=1 


m 




r- 
VO 
tT 


CM 

\n 


On 
CO 


CO 
CO 


Predicted 
First AA 

of 
Secreted 
Portion 


CO 


oo 


VO 
04 


CO 


lO 
CN 


CO 




CO 




V 1 

CM 


CO 




CO 


















m 


<o 


»o 

VO 

cn 


VO 
VO 
CO 


OO 
CM 

»o 


VO 

CO 


5' NT 
of 

Pirct 

rirsi 

AAof 

Signal 

Pep 


On 

cn 


o 

m 


VO 
On 


ON 
CN 
<N 


VO 
CO 


VO 
CO 

CM 


5" NT 
of 

Start 
Codon 


o\ 

CO 




On 


o\ 


VO 

CO 


VO 
CO 
CM 


3' NT 

of 
Clone 
Seq. 


1736 


2309 

i 

1 
1 


3492 

i 
1 


CO 
OO 

oo 


1033 

1 

1 
1 


1 1541 


5' NT 

of 
Clone 
Seq. 


1568 

j 


On 

ON 

CN 


CO 
OO 

oo 


CO 


cs 




Total 
NT 
Seq. 


1851 




3542 


CO 
OO 

oo 


o 
oo 
o 


1541 


CO Z 


VO 


CM 






VO 
CN 
CM 


*o 

VO 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


ZAP Express 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97899 
02/26/97 
209045 

Uj/lj/y/ 




97899 
02/26/97 
209045 
ns/1 5/Q7 


97899 
02/26/97 
209045 

U j/ 1 j/ 7 / 


97899 
02/26/97 
209045 

UJ/ 1 J/7 / 


97899 
02/26/97 
209045 


cDNA 
Clone ID 


HPRCE95 


HPRCE95 


HHTLC66 


HMADJ02 


HMADJ02 


HPRCU93 


Gene 
No. 




VO 


m 
wo 


VO 


VO 


VO 
VO 



SDOaO: <WO_983944aA?_lj> 



wo 98/39448 



130 



PCT/US98/04493 





E50 


m 


CN 


IN 




s 

CN 


ON 


Predicted 
First AA 
or 

Secreted 
Portion 


<o 

CN 


ON 


CN 






o 

CN 


CO 




CN 


OO 


O 
(N 






ON 


o 

cn 




















C5\ 
CN 


1 


O 

cn 


ON 

ro 


rn 
in 


o 

CO 


cn 


5' NT 
of 

Pi ret 

AAof 
Signal 
Pep 


I 


cn 

NO 


1262 


S 

CN 


AZZ 


w^ 

ON 


CN 
CN 


5' NT 

of 

Start 
Codon 


NO 


cn 

VO 


1262 


S 

CN 


Z.ZZ 


iO 

ON 


ZZ 


z^s § « 


\o 
rn 
cn 


oo 

C3\ 
VO 


1756 


On 
CN 
VO 


VO 

CO 

wn 


1751 


OO 

o 


5* NT 

of 
Clone 
Seq. 


Tt 




1133 




»o 

CN 


r- 

CO 




Total 
NT 
Seq. 


NO 

cn 
cn 


CN 

cn 


2043 


o\ 

CN 
VO 


IT) 


1751 


oo 
o 
wo 


CO "^Z 


CN 
CN 


VO 
VO 


oo 




On 


OO 

\o 


VO 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 

1 


Uni-ZAPXR 


Uni-ZAPXR 


Lambda ZAP 
II 


Uni-ZAPXR 


ZAP Express 


ATCC 
Deposit 
No:Z 
and Date 


97899 
02/26/97 
209045 

05/lj/y7 


97899 
02/26/97 
209045 

U j/ 1 D/i? / 


97899 
02/26/97 
209045 


209011 
04/28/97 


97899 
02/26/97 
209045 

Ujt 1 j/7 / 


97899 
02/26/97 
209045 

VJl iJiy 1 


97899 
02/26/97 
209045 

UJ/ / 


cDNA 
Clone ID 


■HPRCU93 


HSAXS65 


HSAXS65 


HKTAG35 


CN 

X 


HHFHN61 


o 

it 


Gene 
No. 


wo 


VO 
WO 


VO 


WO 




OO 
WO 





wo 98/39448 PCT/US98/04493 

131 









VO 


O 
VO 




Ov 
CO 


Predicted 
First AA 

of 
Secreted 
Portion 


CO 

CN 


cs 


CO 


oo 


CO 


Ov 
<N 




<N 




O 
cn 




o 

CO 


OO 


















CN 

m 
»n 


CM 
CO 


CO 

cn 


ro 

CO 


CO 


CO 


5' NT 
of 

AAof 
Signal 
Pep 




ON 




O 
cs 


a\ 

VO 


ov 

cs 

CO 


5' NT 
of 

Start 
Codon 




m 




o 
cs 


o\ 

VO 


o\ 
cs 

» 


3' NT 

of 
Clone 
Seq. 


oo 




CO 


o 


CO 

r- 


o 
oo 
»o 


5- NT 

of 
Clone 
Seq. 










oo 


o 

ON 


Total 
NT 
Seq. 


oo 




»— 1 

VO 


o 


CO 


o 

CO 

oo 




o 
cs 


o 




CO 




cs 

CO 

cs 


Vector 


ZAP Express 


LambdaZAP 
II 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAP XR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97899 
02/26/97 
209045 

yjjlijiyi 


97899 
02/26/97 
209045 

0^/1 ^/Q7 


97900 
02/26/97 
209046 
05/15/97 


97900 
02/26/97 
209046 
05/15/07 


97900 
02/26/97 
209046 
ns/1 S/97 

UJ/ IJU 1 


97900 
02/26/97 

209046 
05/1 5/Q7 


cDNA 
Clone ID 


HCWEF90 


HHGCM20 


HFRAUlO 


HFRAUlO 


HATDT67 


HATDT67 


Gene 
No. 




O 






CM 
VO 


cs 
VO 



SOOCIO: <WO_9839448A^I_> 



wo 98/39448 PCT/OS98/04493 

132 





5 


CS 


cn 
O 
CS 


VO 

cn 


On 

cs 


NO 

cn 


Predicted 
First AA 

01 

Secreted 
Portion 


cn 




cn 


cn 
cs 


c?\ 
cs 


cn 




O 
cn 




O 
cn 


cs 

cs 


oo 
cs 


o 
cn 


















wn 
r- 
cn 


cn 
>o 


so 
cn 


VO 

cn 


LL£ 


oo 
cn 


5* NT 
of 

AAof 
Signal 
Pep 


s 


r- 
oo 
cs 


o 
cn 
r- 


2577 


<N 


cn 


5' NT 
of 

Start 
Codon 


s 


oo 
cs 


o 

rn 


2577 


CS 


cn 


^ a ■ 


CN 
VO 

oo 


»o 

o 

ON 


4525 


2739 


1195 




5' NT 

of 
Clone 
Seq. 




oo 
cn 


4162 


2406 






Total 
NT 
Seq. 


cs 

NO 

oo 


cs 
m 
o\ 


4602 


2786 


1255 


in 




cn 


cn 
m 
cs 








NO 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-Zap XR 


Uni-ZapXR 




Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 
xjji iJiy 1 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 

U J/ 1 J/ 7 / 


cDNA 
Clone ID 


HOUBG93 


cn 
OS 
O 

m 
o 


HMWEX24 


HMWEX24 


oo 

< 

O. 

CO 


HTOCD52 


Gene 
No. 


cn 


cn 




VO 


in 

VO 


VO 

NO 



wo 98/39448 PCT/US98/04493 

133 





2; 


5 


oo 
VO 


oo 


CM 


a\ 

CM 


Predicted 
First AA 
or 

Secreted 
Portion 




CO 


CO 






oo 






CO 

CO 


o 
CO 




o 

CM 




















CO 

m 


o\. 

CO 


o 

oo 

CO 


oo 

CO 

wo 


OO 
CO 


CM 
OO 
CO 


5' NT 
of 

AAof 
Signal 
Pep 


NO 
CM 




VO 
CM 


VO 
CM 


t-- 

VO 
CM 


CM 
C3V 
CM 


5' NT 

of 

Start 
Codon 


VO 




VO 
CM 


CM 


VO 


CM 

o\ 

CM 


3* NT 

of 
Clone 
Seq. 


oo 
uo 


a\ 

CM 


1730 

1 




1168 

i 


1285 


5' NT 

of 
Clone 
Seq. 




uo 

CM 


1627 




VO 
CO 


CM 
CO 


Total 
NT 
Seq. 


oo 

VO 


»r> 
VO 


1907 


VO 


89II 


1285 


z w 9 2 X 


uo 

CO 
CM 




oo 


VO 
CO 
CM 


o\ 


o 

oo 


Vector: 


Uni-ZAPXR 


Uni-ZAPXR 


i pBluescript 


pBluescript 


Uni-ZAPXR 


OS" 
X 

< 

N 

t 

B 


ATCC 

Deposit 

and Date 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 
05/15/97 


97900 
02/26/97 
209046 
05/15/07 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 
05/1 5/97 


cDNA 
Clone ID 


HTOCD52 


HTGCP16 


HKIXR69 


HKIXR69 


HETGJ09 


H0BNC61 


Gene 
No. 


VO 
VO 


VO 


oo 
VO 


oo 

VO 


ON 

SO 


o 



SOCX^tD: <WO_983944aA2_l_> 



wo 98/39448 PCTAJS98/04493 

134 





OO 




cs 
uo 




ro 


<o 
oo 


Predicted 
First AA 
or 

Secreted 
Portion 


<N 
cs 




en 




vO 
CS 


CO 




CM 


o 


O 




w^ 
CS 


o 

CO 






























ro 
oo 

CO 


oo 
ro 


oo 

CO 


On 
CO 
WO 


VO 
OO 
CO 


oo 

CO 


5' NT 
of 

Pi ret 

AAof 
Signal 
Pep 


o 


ps 




1204 


wo 
oo 


VO 
VO 


NT 

of 

Start 
Codon 


o 


On 


o 
o 
cs 




w^ 
oo 


VO 
VO 


3' NT 

of 
Clone 
Seq. 


'^f 
o 


oo 

VO 


1953 


ON 
WO 
On 


CO 

wo 


o 
oo 


5' NT 

of 
Clone 
Seq. 


oo 




1609 


OS 

cn 




a\ 
wo 


Total 
NT 
Seq. 


1290 


oo 

VO 


2024 


SO 
OO 

cs 


CO 
C7\ 


wo 

cs 
oo 




OO 


cs 
oo 


CO 
OO 


CO 

cs 


oo 


wo 
oo 






XR 


XR 


XR 


X 


XR 


Vectoi 


r 


i-ZAP 


i-ZAP 


i-ZAP 


i-ZAP 


i-ZAP 






Uni 


Uni 


Uni 


Uni 


Uni 


ATCC 
Deposit 
No:Z 
and Date 


97900 
02/26/97 
209046 

Uj/1j/7/ 


97900 
02/26/97 
209046 

AC/1 C/Q7 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 

S/Q7 


cDNA 
Clone K) 


OS 

S 


HBIAI9^ 


HSQEL25 


wo 
cs 

a 

CO 


HEBEG68 


HBIAB39 


Gene 
No. 




CS 


CO 


CO 




WO 



SOOCiD: <WO_9839448A^.> 



wo 98/39448 PCT/US98/04493 

135 





oo 
o 


so 
O 




On 

ON 


rn 


CN 


Predicted 
First AA 

01 

Secreted 
Portion 


oo 

CO 


VO 




uo 






VO 

cn 


















CO 






cn 
*n 


cn 


*• 1 
cn 




















? 


oo 
oo 
cn 


Ov 

oo 
cn 




cn 


5' NT 
of 

Pirct 

AAof 
Signal 
Pep 








VO 
VO 


o 
wo 


o 

ON 

cn 


5' NT 
Of 

Start 
Codon 








VO 
VO 


o 


o 

ON 

cn 


^ S • 


CO 


ON 


oo 
o\ 


oo 


2080 


1520 


5' NT 

of 
Clone 
Seq. 




o 
oo 


VO 

cn. 


ON 


oo 


cn 


Total 
NT 
Seq. 




s 

oo 


1238 


o 

VO 


2201 


1661 




oo 
cn 

<N 


o\ 
cn 


vO 
oo 


oo 


o ' 






XR 


XR 


XR 


XR 


XR 




XR 


Vectoi 


-ZAP 


-ZAP 


-ZAP 


-ZAP 


-ZAP 




-ZAP 




Uni 


Uni 


Uni 


Uni 


Uni 




Uni 


ATCC 
Deposit 
No:Z 
and Date 


97900 
02/26/97 
209046 
n^/i ^/Q7 


97900 
02/26/97 
209046 

UJ/ l*J/7 / 


97900 
02/26/97 
209046 

UJ/ lJ/7 1 


97900 
02/26/97 
209046 

\JD/ ij/y 1 


97900 
02/26/97 


209046 
05/15/97 


97900 
02/26/97 
209046 
uj/ ijiyi 


cDNA 
Clone ID 


HBIAB39 


ON 

cn 
X 


HTXDU73 


HOEAS24 


CO 

o 

X 


HOEAS24 


Gene 
No. 






VO 

r- 




r- 


r- 



ISDOCIO: <WO_983944SA2_L> 



wo 98/39448 



136 



FCTAJS98/04493 





5\ 


o 


3\ < 
ro I 


3N > 


o 

■o 


3 
dO 


Predicted 
First AA 

nf 

or 

Secreted 
Portion 




z> 

r\ 




— • < 

"O < 


r> 
ro 


ro 




<o 
m 




CN i 


"O 


CN 
ro 


CO 






p— t 












o 

ON 

CO 


0\ 

ro 


t 


j\ 
ro 


ro 

ON 

ro 


ON 

ro 


5' NT 
of 

PI ret 

AAof 
Signal 
Pep 


OS 

CO 

VO 


O 


s 


S 


o\ 


OO 
ON 


^' NT 
Jill 

Of 

Start 
Codon 


o\ 

CO 

VO 


? 

uo 


s 


o 
— 


CO 


OO 
0\ 


8 


lO 
ON 
CO 


9811 


1146 


1614 


cs 

VO 
OO 


VO 

ON 

VO 


5* NT 

of 
Clone 
Seq. 


r- 
VO 


\r\ 

CO 


ON 
CO 


1203 


CO 


0\ 
CO 


Total 
NT 
Seq. 


1395 


1186 


1146 


1821 


VO 
OO 


VO 
On 
VO 




OO 
OO 


OO 




O 

ON 


ON 


ON 


Vector . 


X 
a. 
< 

'5 


pBluescript 


pBluescript 


a. 
< 

1 

c 


Uni-Zap XR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97900 
02/26/97 
209046 

05/15/97 


97900 
02/26/97 
209046 

AC/1 c/07 


97900 
02/26/97 
209046 

U J/ 1 j/7 / 


97900 
02/26/97 
209046 

uj/iD/y/ 


97900 
02/26/97 
209046 


97900 
02/26/97 
209046 


cDNA 
Clone ID 


HTEIY30 


VO 

s 

00 

a: 


HSKNE46 


HPMFL27 


HMWDN32 


HPRAX55 


Gene 
No. 


OO 


ON 


o\ 


o 

OO 


OO 


OO 



wo 98/39448 PCTAJS98/04493 

137 





OO 

\n 


CM 


g 




CO 
CO 


o 


C^ 
O 

ro 


Predicted 
First AA 

of 

Secreted 
Portion 


CO 
CO 




CN 


CO 

CO 


CM 


CO 


CM 
CM 




CO 




CN 


CO 




o 

CO 


CM 












































CO 


VO 

OS 
CO 


Ov 
CO 


VO 
rr 


oo 

OS 
CO . 




5' NT 
of 

AAof 
Signal 
Pep 


oo 

CO 


On 


oo 


VO 

o 

CM 


OS 


CO 
CM 


CM 


NT 
of 
Start 
Codon 


oo 
CO 


o\ 


uo 
oo 
r- 


VO 

o 
cs 


OS 


CO 
CM 


uo 

CM 


3' NT 

of 
Clone 
Seq. 


1230 


1759 


1772 


oo 

s 




2801 


CO 


5' NT 

of 
Clone 
Seq. 


*o 

VO 










OO 




Total 
NT 
Seq. 


1350 


1886 


1774 


2503 


1529 


2801 


1537 




CO 

cs 


CO 

OS 




uo 

OS 


CM 


VO 
OS 


to 




XR 


XR 


XR 




XR 




XR 




XR 


Vectoi 


i-ZAP 


i-ZAP 


i-ZAP 




i-ZAP 




i-ZAP 




i-ZAP 




Uni 


a 


c 
D 




c 
D 




B 




c 


ATCC 
Deposit 
No:Z 
and Date 


97900 
02/26/97 
209046 

nr/i c/07 


97900 
02/26/97 
209046 

u j/ 1 D/y 1 


97901 
02/26/97 


209047 
05/15/97 


209076 
05/22/97 


97901 
02/26/97 
209047 


97901 
02/26/97 


209047 
05/15/97 


97901 
02/26/97 
209047 

yjji ij/y / 


cDNA 
Clone ID 


HPRAX55 


VO 
CO 


HE2PL77 


HSDFV29 


CO 

I 

a 
u 


HTPEG42 


CM 

o 

Pu 
H 
K 


Gene 
No. 


OO 


CO 

oo 


oo 


oo 


»o 
oo 


VO 
OO 


VO 
CO 



iSOOaD: <WO_9839448A2_l_> 



wo 98/39448 PCTAJS98/04493 

138 



35^1 




t 


— — 


^ I 

■< 






Predicted 
First AA 
or 

Secreted 
Portion 






■n 














"O 


30 


?^ 




to <f« u_ bO 
















ON 
ON 


S 


m 






cs 


5' NT 
of 

rirsi 
AAof 
Signal 
Pep 




2J 


oo 

p— < 




m 
0 

ro 


cs 


of 

Start 
Codon 


» t 


o\ 


CO 
X) 




m 
ro 




3' NT 

of 
Clone 
Seq. 


1631 




o\ 

ON 


1416 


1348 1 


2847 


5' NT 

of 
Clone 
Seq. 


VO 


cs 




2; 


00 




Total 
NT 
Seq. 


1631 


s 


90S 


1416 


1348 


2847 




as 


oo 


so 


ON 
ON 


CS 


S 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


X 
c 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97901 
02/26/97 
209047 

AC/1 c/m 


■.97901 
02/26/97 
209047 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 

AC/1 C/Q7 


97901 
02/26/97 
209047 

AC /I C/Q7 


i 

cDNA 
Clone ID 


HLHDR57 


■ HAUAV32 


"HAUAV32 


HNEB160 


8 
z 


HSHCJ16 


Gene 
No. 


oo 


OO 

oo 


oo 
oo 


o\ 


ON 
00 


0 

ON 



wo 98/39448 PCTAJS98/04493 

139 





00 


ON 


00 
VO 








Predicted 
First AA 
01. 

Secreted 
Portion 




CO 


CO 


o\ 




vo 
CO 


■3 < «*-. sp J?< 


CO 


0 

CO 


0 

CO 


00 




CO 


First 
AA 
of 
•Sig 
Pep 
















CO 

0 
^ 


s 


0 
»o 


lo 

vo 


vo 
0 


vo 

0 


5' NT 
of 

AAof 
Signal 
Pep 


0 
VO 


00 
uo 


vo 
CO 


2; 


vo 

o\ 


00 


5' NT 
of 

Start 
Codon 


0 
VO 


00 
m 


VO 
VO 
CO 




vo 
On 


00 


3' NT 

of 
Clone 
Seq. 


1346 


s; 


1766 


1708 

1 


1531 


00 


5' NT 

of 
Clone 
Seq.. 


00 
0 
VO 








00 

ON 
00 


vo 
0 


Total 
NT 
Seq. 


1394 


ON 


1766 


2664 


VO 


r- 
00 




0 


0 


00 

CM 


On 


CO 

"r-4 


s 

»— ♦ 


Vector 


pBIuescript 


X 
1 

'S 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 

Deposit 
No:Z 
and Dale 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 

uj/ 1 / 


97901 
02/26/97 
209047 
05/15/97 


97901 
02/26/97 
209047 
OS/ 15/97 


97901 
02/26/97 
209047 
05/1 5/97 


97901 
02/26/97 

209047 
OS/15/07 


cDNA 
Clone ID 


HTSEL31 


HAUBL57 


HAUBL57 


HAUBL57 

1 


HODAS59 


HE6CT48 


Gene 
No. 


ON 


ON 


ON 


On 


CO 

ON 


3: 



iSDOaO: <WO_9839448A2J_?. 



wo 98/39448 



140 



PCT/US98/04493 



35^1 




s 




o 

OO 

cs 


to 


to 

CM 


OO 
CM 


Predicted 
First AA 
or 

Secreted 
Portion 


o 


CM 
CM 


m 

CM 


CM 

to 


VO 




CO 






»— 4 

CM 


CM 


to 


to 


O 


o 

CO 


to <* Cli 
E < ^ CO ou 


















to 
*o 




CO 

to 
to 


oo 


VO 

to 


»o 
to 
to 


ON 


5' NT 
of 

AAof 
Signal 
Pep 


oo 
to 

CN 


SO 


ON 
CM 
OO 


CM 

rj 


CO 

CO 
VO 


rs| 
oo 


to 

VO 


S' NT 
J ill 

of 

Start 
Codon 


OO 

to 

CN 


VO 


C7N 
CM 
OO 


CM 
<M 




CM 

oo 


to 


3* NT 

of 
Clone 
Seq. 


to 

OO 




2074 


1542 


1482 


CO 

oo 


2327 


5' NT 

of 
Clone 
Seq. 


ON 




CM 

to 

OO 


VO 

o 


oo 
o 
to 




1528 


Total 
NT 
Seq. 


to 
VO 
oo 




■2082 


1542 


1482 


CO 

oo 


2327 


CO Z 


o 
to 


to 
o 


to 

CM 


O 


cs 


CO 

to 




Vector 


Uni-ZAPXR 


X 
s 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAP XR 


ZAP Express 


Uni-ZAPXR 


ATCC 
Deposit 

and Date 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 

Uj/ iD/y t 


97901 
02/26/97 
209047 

05/1 5/97 


97901 
02/26/97 

209047 
05/15/97 


97901 
02/26/97 

209047 
05/15/97 


209215 
08/21/97 


97901 
02/26/97 
209047 
05/1 5/07 


cDNA 
Clone ID 


HE6CT48 


1 


HMDAA61 


HAQBK61 


HAQBK61 


HCUHBOl 


HAQBF73 


Gene 
No. 


ON 


to 

On 


to 

ON 


VO 

ON 


\o 

On 


VO 
C3N 


On 



wo 98/39448 



141 



PCTAJS98/04493 





On 


OO 


CO 
<N 




CM 

oo 


CM 

ON 


Predicted 
First AA 
or 

Secreted 
Portion 




ON 




CO 


CO 








oo 


o 

CO 


CO 


CM 


vO 




1 














VO 


o 




*o 


OO 
iO 
lO 


CM 


5' NT 
of 

Ftrct 

AAof 
Signal 
Pep 


oo 
oo 

OS 




CO 

O 

ON 




lO 




NT 

of 

Start. 
Codon 




<M 


CO 

O 

ON 


VO 






3' NT 

of 
Clone 
Seq. 


1508 


<N 
VO 
O 


2501 


2431 


2288 

i 


1751 

1 


5' NT 

of 
Clone 
Seq. 


oo 
oo 






On 
VO 


VO 
VO 


On 
VO 
C7N 


Total 
NT 
Seq. 


1508 

1 


VO 

o 


2539 


to 


2357 


1751 




*o 


oo 
o 


§ 


VO 

»o 

CM 


VO 
VO 
CN 


o 


Vector 


X 

Dm 

• 

"S 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Lambda ZAP 
II 


ATCC 
Deposit 
No:Z 
and Date 


97901 
02/26/97 
209047 
ijij 1 


97901 
02/26/97 
209047 

\jjt LJty I 


97901 
02/26/97 
209047 
yjji uiy 1 


97901 
02/26/97 
209047 
n'?/! 5/97 


97901 
02/26/97 
209047 
05/15/97 


97901 

02/26/97 

209047 

05/1 5/97 
ujf i-jiy 1 


cDNA 
Clone E) 


HAQBF73 


HAQBT94 


HETHE07 


HETHE07 


HETHE07 


HLQAB52 


Gene 
No. 


ON 


oo 
o\ 


ON 
ON 


ON 
ON 


ON 
ON 


8 



ISOOaO: <WO_983944aA2_L> 



wo 98/39448 



142 



PCT/US98/04493 




wo 98/39448 PCT/US98/04493 

143 





CO 
VO 


m 

CN 


oo 






cn 


Predicted 
First AA 
or 

Secreted 
Portion 


( 

CO 


cn 




vo 
VO 




cn 




o 
tn 


O 

cn 




»o 
vo 




cn 
cn 








•—4 










vo 
^ 


cn 
vo 


3 




vo 


vo 
vo 


y NT 
of 

Fir^t 

AAof 
Signal 
Pep 


m 
cn 


v-> 

cn 
cn 


Ov 


s 




cn 


5' NT 
of 

Start 
Codon 


cn 
cn 


cn 
cn 




o 
o 






3" NT 

of 
Clone 
Seq. 


1171 


vo 


1131 


8 

OO 


*o 
cn 


cn 
oo 


5' NT 

of 
Clone 
Seq. 


O 


vo 
VO 


ON 
CN 
VO 


cn 
r- 
cn 


o 

On 
CM 


vo 


Total 
NT 
Seq. 


1171 


1179 


1162 


CM 

oo 


*o 
cn 
r*- 


cn 
oo 








vo 
CM 


vo 


cn 
vo 
CM 




Vector 


ZAP Express 


ZAP Express 


ZAP Express 


Uni-ZAPXR 


X 

C 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 

KJJf iJ/y 1 


97901 

02/26/97 

209047 

05/1 V97 
u J/ u/y t 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 

vjf ij/y / 


97901 
02/26/97 
209047 

yjji ijiy 1 


cDNA 
Clone ID 


HCUBC79 


HCUBC79 


HCUBC79 


> 

00 


HSVAF07 




Gene 
No. 


s 


s 

*— 1 


o 


uo 
o 

1— « 


VO 

o 


vo 

o 



ISOOCID: <WO_983944aA2J_> 



wo 98/39448 



144 



PCT/US98/04493 





o 


ro 




o 


m 


CO 


Predicted 
First AA 

01 

Secreted 
Portion 




CO 


CM 


o\ 
CM 


CO 


CS 


•g <t 60 0, 


o 

CO 


O 


CM 


OO 
CM 


o 
CO 


CO 
<M 


i£ < O CO OL, 
















oo 


o 
»n 


OO 

VO 


OV 


o 

CM 


CJV 

\o 


5' NT 
of 

AAof 
Signal 
Pep 


oo 
»n 


fill 


oo 

CO 




o 
CO 


r- 

CM 
CM 


5' NT 

J 111 

of 

Start 
Codon 


oo 
»o 


a\ 


oo 

CO 


as 
o\ 


o 

CO 


LZZ 




o 
2; 


O 

2; 


1188 


VO 

o 

OS 


§ 


1050 


5" NT 

of 
Clone 
Seq. 


oo 


o 
m 


oo 


oo 


CM 


CM 


Total 
NT 
Seq. 


1640 


oo 
VO 


1455 


CM 

o\ 


1256 


1086 




VO 


»o 

VO 

cs 


992 




OO 


NO 

tN 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 

n*;/ IS/07 


97901 
02/26/97 
209047 

u*?/ iJiy / 


97901 
02/26/97 
209047 

OS /I S/Q7 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 


cDNA 
Clone ID 


;o 

P 
PC 


VO 

1 

CO 

H 


HT3AM65 


HE6DK18 


HEBEK93 


HEBEK93 


Gene 
No. 


VO 
O 


VO 

o 


VO 

o 


o 


OO 

o 


OO 

o 



wo 98/39448 



145 



PCT/US98/04493 







S 


CO 


S 

CS 




CM 
CO 


Predicted 
First AA 
or 

Secreted 
Portion 


1—1 


CO 


oo 


fO 
CO 


CO 


OO 
CM 




o 


cn 




CN 
CO 


o 

ro 


CM 




















O 
*o 




CM 


CO 

c^ 


CN 

r- 
*n 


5- NT 
of 

AAof 
Signal 
Pep 






Ol 
CO 
CM 


OO 
CO 


o 


CO 
CO 


5' NT 
of 

Start 
Codon 


»o 




CS 

m 
<N 


OO 
CO 


o 


r- 

CO 

CO 


3' NT 

of 
Clone 
Seq. 


1051 


m 

s 


1015 


QZL\ 


S 


VO 
VO 


5' NT 

of 
Clone 
Seq. 










OO 


»— ( 


Total 
NT 
Seq. 


2; 
»— < 


1003 


1234 


1782 


o 






ON 


OO 
04 


ON 

cs 


o 

CM 


CM 


o 
r- 
cs 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97901 
02/26/97 
209047 
n^/i S/Q7 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 
05/15/97 


97901 
02/26/97 
209047 


97901 
02/26/97 
209047 
nS/1 VQ7 


97901 
02/26/97 
209047 

05/1 VQ7 
u^/ uiy 1 


cDNA 
Clone ID 


O 


HJPCMIO 


HJPCMIO 


HSXBL78 


H0EAW81 


H0EAW81 


Gene 
No. 


On 
O 


ON 

o 


On 
O 


o 







ISOOCID: <WO_983944aA2^!_> 



wo 98/39448 



146 



PCTAJS98/04493 



35o| 


cs 




x> 


2 




O 


Predicted 
First AA 

01 

Secreted 
Portion 










O 


fO 

cs 






m 








"S 

cs 






















m 
>o 


sO 


cs 


in 


5" NT 
of 

AAof 
Signal 
Pep 


s 


oo 


oo 

DO 






o 
oo 


*)' NT 
of 

Start 
Codon 


2 


oo 


oo 
oo 


VO 


WO 


o 
oo 
cs 


3' NT 

of 
Clone 
Seq. 


m 


o\ 


1626 


1640 


s 

oo 


CO 
so 


5' NT 

of 
Clone 
Seq. 


*o 
oo 


1179 


a\ 
oo 
oo 


S 






Total 
NT 
Seq. 


cs 


2081 


1731 


1717 


3 

oo 


1320 




CS 


cs 






<s 


cs 
cs 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


OH 
X 

a 


OS 
X 

*s 

D 


pSportl 


pSportl 


ATCC 
Deposit 
No:Z 
and Date 


97501 
02/26/97 
209047 

nc/1 C/Q7 


97901 
02/26/97 
209047 

\Jj/lj/y/ 


97901 
02/26/97 
209047 

UJ/ 1 J/7 / 


<N ^ 2 & 
S so o <r) 


97902 
02/26/97 
209048 


97902 
02/26/971 
209048 


cDNA 
Clone ID 


1 

O 
K 


HEAAR60 


HEAAR60 


in 

CO 

O 
X 
H 
K 


CO 


HOVBA03 


Gene 
No. 


cs 


CO 


CO 


3; 


uo 

»— 4 





wo 98/39448 



PCT/US98/04493 



1^ o| 




o 

CO 


o 

CO 


ON 
CM 


o 

CO 


CM 


Predicted 
First AA 

or 
Secreted 
Portion 


m 




CO 


OS 








oo 
m 




/— N 
CO 


oo 

*— ( 






















oo 
5! 




ON 
CM 


NO 


o 

CO 


CO 


5' NT 
of 

Pirct- 

AAof 
Signal 
Pep 


r- 


CO 


OO 


2777 


so 

as 
oo 


1265 


5* NT 
of 

Start 
Codon 


CO 


CO 


oo 


2777 


vo 

o\ 
oo 


«n 
so 


3' NT 

of 
Clone 
Seq. 


CO 




3752 


2995 




1830 


5" NT 

of 
Clone 
Seq. 






3465 


2738 


a\ 

vo 
VO 


1234 


Total 
NT 
Seq, 


CO 


»o 


3752 


2995 


1144 


1830 






CO 
CM 


CM 




oo 

CM 


On 
CM 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


tsl 
'S 


Uni-ZAP XR 


ATCC 
Deposit 
No:Z 
and Date 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 

UJ/ 1 J/7/ 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 
uj/i j/y/ 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 
Uj/ Ijiy I 


cDNA 
Clone ID 


HGBGK76 


vo 

O 
PQ 
O 


oo' 

m 


HBMUW78 


HASAS24 


HSIDN55 


Gene 
No. 


vo 


vo 






OO 


ON 



SDOaCh <WO_983944aA2J_> 



wo 98/39448 



148 



PCT/US98/04493 



1^ o| 






CN 
ON 


r- 


rs 

CO 


O 
CO 


Predicted 
First AA 
or 

Secreted 
Portion 


m 


m 




CO 
CN 


vO 
CS 


VO 
CS 




CO 


CO 


VO 


CS 
CN 


cs 


*o 

CN 


















CN 
CO 


ro 
m 




CO 


to 

CO 

Tj- 


VO 
CO 


5' NT 
of 

Fircf 

AAof 
Signal 
Pep 


1578 


VO 




1127 


CN 

VO 

o\ 


CN 


5' NT 
of 

Start 
Codon 


1578 


VO 




1127 


cs 

VO 

o\ 


CN 


3'NT 

of 
Clone 
Seq. 


1741 


1214 


1128 

1 


1986 


1632 


1565 


5' NT 

of 
Clone 
Seq. 


11505 




oo 


CO 
»o 
oo 


o 

VO 


oo 

CN 


Total 
NT 
Seq. 


1864 


2041 


1990 


2012 


6991 


1565 




o 

CO 


CO 


cs 




CO 
CO 


CO 


Vector 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ai 
X 

< 

13 


pBluescript 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97902 
02/26/97 
209048 

Uj/lj/y/ 


97902 
02/26/97 
209048 

UD/ 1 j/y / 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 

VJ/ 1 J/7 / 


97902 
02/26/97 
209048 


cDNA 
Clone ID 


HGBGZ64 


VO 

> 

m 

VO 


H6EBJ64 


HOECP43 


H2CBV31 


HPCAD23 


Gene 
No. 


o 

cs 


CN 


CS 


cs 
cs 


CO 
CN 


Ti- 
cs 



wo 98/39448 



149 



PCT/US98/04493 





o\ 

SO 




CO 




CO 
CO 


CO 

\n 


Predicted 
First AA 

01 

Secreted 
Portion 


O 




CO 


CO 




NO 

cs 




OS 

CO 




o 

CO 


o 

CO 


oo 


to 

CM 


















CO 


OO 
CO 


o\ 

CO 


oo 
r- 


o 


5 


5" NT 
of 

AAof 
Signa] 
Pep 


1124 




oo 


VO 


1183 


oo 


5' NT 

ill 

of 

Stan 
Codon 


1124 


§ 


oo 


VO 
cs 


1183 


OO • 

to 


3' NT 

of 
Clone 
Seq. 


2007 


081 1 


1906 


2436 




1347 


5' NT 

of 
Clone 
Seq, 


1101 






CM 

to 


i 


CM 


Total 
NT 
Seq. 


2007 


1291 


1906 


VO 
CO 


1935 


VO 




CO 


so 

CO 


CO 




OO 
CO 
1— « 


On 
CO 


Vector 


pSportl 


Uni-ZAPXR 


Lambda ZAP 

n 


Lambda ZAP 
II 


pSportl 


X 


ATCC 
Deposit 
No:Z 
and Date 


97902 
02/26/97 
209048 

yjji ijfy / 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 
nVl 5/97 


97902 
02/26/97 
209048 
05/15/97 


97902 
02/26/97 
209048 
05/1 5/97 


97902 
02/26/97 
209048 
ns/i s/97 

uj/ ijiy 1 


cDNA 
Clone ID 


HSPAG15 


CO 

o 


HUSHH48 


HUSHH48 


ON 

D 
< 


VO 

> 
U 
oo 

33 


Gene 
No. 




so 




r- 


oo 

cs 


ON 

cs 



SDOCID: <WO_9839448A5_l_> 



wo 98/39448 



150 



PCT/US98/04493 





S 




OS 
DO 




m 


On 


Predicted 
First AA 

of 
Secreted 
Portion 






rn 


(N 

cn 


<o 

CN 










O 


_i 
m 






















*— « 
















cs 






ON 

CO 




5 


5' NT 
of 

rirsi 

AAof 

Signal 

Pep 


so 


0\ 








o 
m 


of 

Start 
Codon 




OS 








so 
o 
m 


3' NT 

of 
Clone 
Seq. 


6011 


ON 


OS 
SO 
<N 


oo 
t-- 


1262 


I AS I 


5' NT 

of 
Clone 
Seq. 


OS 
CTl 
NO 


Os 




oo 
o 




OS 


Total 
NT 
Seq. 


1109 


r- 

OS 


OS 
VO 
CN 


CN 

oo 


1269 


1944 




o 




<M 


CN 


cn 






a: 

X 


X 




X 


X 




a: 
X 


Vectoi 


-ZAP 


-ZAP 




-ZAP 


-ZAP 




-ZAP 




"S 


Uni 




'S 


'5 




'c 


ATCC 
Deposit 
No:Z 
aind Date 


97902 
02/26/97 
209048 


9790i 
02/26/97 
209048 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 


cDNA 
Clone ID 


1 


HEBGA37 


HEBFU93 


HEBFU93 


O 

NO 

U 
CO 

O 

00 

as 


HPMGD24 


Gene 
No. 


o 


CO 


CN 
CO 

•—4 


CN 


cn 
cn 


cn 



wo 98/39448 PCr/US98/04493 

151 





OO 


o 


On 

ON 


o 
m 


oo 
to 

CN 




Predicted 
First AA 
or 

Secreted 
Portion 




CN 


cn 


CN 


cn 


CN 




o 
CO 


cn 

CN 


o 
cn 


VO 
CM 


O 
cn 


cn 

CN 




















O 
OO 

to 


oo 


oo 
to 


ON 


CN 
OO 

to 


5' NT 
of 

Firct 

AAof 
Signal 
Pep 




to 
to 


VO 


CN 

cn 


to 

VO 


o 

VO 


5' NT 
of 

Start 
Cddon 




to 
to 


VO 


CN 

cn 


to 
VO 


O 
VO 


3' NT 

of 
Clone 
Seq. 


1021 


VO 
On 


1285 

i 


oo 

CN 
CN 


CM 


1208 


5* NT 

of 
Clone 
Seq. 


VO 
CN 

to 


CN 

to 


to 


CJN 


Ov 
VO 


ON 
VO 


Total 
NT 
Seq. 


1021 


ON 


1285 


1228 


VO 
oo 
cn 


1327 




*o 


OO 
CN 


VO 


ON 
CN 


2; 


o 

oo 


Vector 


pBluescript 


pBluescript 


pBluescript 


pBluescript 


Uni-Zap XR 


Uni-Zap XR 


ATCC 
Deposit 
No:Z 
and Date 


97902 
02/26/97 
209048 

yjJfiJfy 1 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 
05/1 5/97 


97902 
02/26/97 
209048 

05/1 


97902 
02/26/97 

209048 
05/15/97 


97902 

02/26/97 

209048 

05/1 5/97 
yjji u/y 1 


cDNA 
Clone ID 


HPTVC60 


HPTVC60 


HSKNE18 


HSKNEI8 


*o 

a: 


HMWIF35 


Gene 
No. 


to 

CO 


to 

cn 


VO 

cn 


VO 
cn 


cn 


cn 



ISOOCID: <WO_983944SA2J_> 



wo 98/39448 PCT/US98/04493 

152 





oo 


^n 

CO 




<N 

»n 


CO 

oo 

CO 


ON 
CO 


Predicted 
First AA 

01 

Secreted 
Portion 




CO 


CO 


CN 
<N 


CO 


CO 
CO 




oo 


" 1 

CO 


CM 


CN 


CO 


CM 
CO 






























o 




CO 

oo 


CN 


CO 


oo 


T'Nt 
of 

AAof 
Signal 
Pep 


oo 




CO 
CS 




oo 


ON 

CN 


5' NT 
Of 

Start 
Codon 


oo 


CN 






oo 


^ 

ON 
CN 


3' NT 

of 
Clone 
Seq. 


2044 


1847 


a\ 

ON 


1517 


1540 


2196 

j 


5' NT 

of 
Clone 
Seq. 


•—4 

CN 


6891 




CO 


oo 
ro 


O 
CN 


Total 
NT 
Seq. 


2098 


1847 


ON 
ON 


1569 


1540 

1 


2196 

1 




OO 


ON 


00 
CM 


o 




CN 
OO 
CN 


Vector 


Uni-ZapXR 


1 pBluescript 


pBluescript 


Uni-ZAPXR 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97902 
02/26/97 
209048 

Uj/ 1 J/y / 


97902 
02/26/97 
209048 


97902. 
02/26/97 
209048 
0 VI 5/97 


97902 
02/26/97 
209048 


97902 
02/26/97 
209048 

UJ/ LJiy 1 


97902 
02/26/97 
209048 
05/15/07 


cDNA 
Clone ID 


HMWGI25 


HSKGF03 


HSKGF03 


j HMSKE75 


HCMSH30 


HCMSH30 


Gene 
No. 


OO 

m 


ON 
CO 


ON 
CO 
1 


o 


5 





SOOCID: '«iWO_983944aA2J_> 



wo 98/39448 PCT/US98/04493 

153 





VO 
OO 


CO 

VO 


OS 


so 


UO 

o 


CO 

CM 


Predicted 
First AA 

or 

Secreted 
Portion 


ro 
\n 


r- 
<N 




CS 


r5 


CM 




cs 


VO 






CO 
CM 


CM 








_ 












*o 
to 


»o 
oo 


VO 

*o 


r- 


SO 
OO 

*o 


5' NT 
of 

AAof 
Signal 
Pep 


VO 


to 

o\ 


VO 


o 




OO 

oo 


5' NT 

•/ ill 

of 

Start 
Codon 


VO 


lO 

On 


so 


o 




OO 

oo 


3' NT 

of 
Clone 
Seq. 


1575 


m 
VO 
oo 


1166 


cs 
*o 


2031 


1485 


5' NT 

of 
Clone 
Seq. 


o 

OS 

so 




r- 




OS 
SO 
so 


to 


Total 
NT 
Seq. 


1719 


m 
po 


1185 


1101 


2031 


1634 






m 


CO 
OO 
CM 


^o 




oo 


Vector 


pSportl 


pBluescript 


pBluescript 


Uni-ZAPXR 


Lambda ZAP 
II 


Lambda ZAP 
II 


ATCC 

Jjeposit 
No:Z 
and Date 


97902 
02/26/97 
209048 
xj/y I 


97902 
02/26/97 
209048 

05/15/07 


97902 
02/26/97 
209048 
05/15/97 


97902 
02/26/97 
209048 
05/1 S/97 


97903 
02/26/97 
209049 
OS/l 5/Q7 


97903 
02/26/97 
209049 
05/15/97 


cDNA 
Clone ID 


HTWGB92 


HBMDM46 


HBMDM46 


HFAMG13 


HFXHL79 


HFXHL79 


Gene 
No. 




cn 


CO 


5 


to 


to 

2 



SOOCID: <WO_9839448A^L> 



wo 98/39448 



154 



PCT/US98/04493 





o 


s 


12 






CN 

CO 
ro 




Predicted 
First AA 
or 

Secreted 
Portion 




cn 


CO 

CN 


CO 


CO 

CN 


CN 






cs 


CO 

cn 


CN 
(N 


o 

ro 


CN 
CN 


CO 
CN 






















oo 


oo 
»o 


a\ 


oo 
oo 


CJN 
OO 


O 




5' NT 
of 

AAof 
Signal 
Pep 


1592 


1562 


CN 
(N 


CN 


CN 
CN 


ro 


1440 


5' NT 

•/ 111 

of 

Start 
Codon 


1592 


ft 


CN 


CN 
CN 


CN 
CN 


c>l 

CO 


1440 


3' NT 

of 
Clone 
Seq. 


1809 


1749 


ON 


OO * 

oo 


O^ 


1422 


2382 


5' NT 

of 
Clone 
Seq. 


1458 


1458 


»o 








1509 


Total 
NT 
Seq. 


1981 


1795 


wo 

C7N 


oo 
wo 
oo 


uo 
On 


2117 


2395 




\o 


>n 

oo 




VO 
oo 

CN 


r- 

OO 
CN 


OO 


OS 


Vector 


Uni-ZAP XR 


Uni-ZAP XR 


pSportl 


pSportl 


Uni-ZAP XR 


pBluescript 


Lambda ZAP 
II 


ATCC 
Deposit 

and Date 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 

WJ/ / 


97903 
02/26/97 
209049 


209139 
07/03/97 


97903 

209049 
Kjjf iJfy f 


97903 
02/26/97 
209049 

VD/lD/yf 


cDNA 
Clone ID 


HSNAK17 


HSNAKi7 


HCFBC03 


HCFBC03 


HSJAP03 


HSKG026 


HCQAV96 


Gene 
No. 










2; 


oo 

2; 


C3N 



wo 98/39448 



155 



PCT/US98/04493 





2 


OO 
<M 


CM 


o 
oo 


oo 

CO 




Predicted 
First AA 

nf 

or 

Secreted 
Portion 




m 








CO 






O 

CO 




O 
m 


VO 


o 

CO 




1— ^ 














CM 
so 


m 
vo 


O 

ON 




ON 


*n 


5' NT 
of 

Fir«:t 

AAof 

Signal 

Pep 


so 


vo 


1062 


oo 
oo 

CM 


oo 

CM 


1611 


5' NT 
of 
Start 
Codon 


1416 


VO 
Tj- 


1062 


oo 
oo 

CM 


OO 
CM 


1611 


3' NT 

of 
Clone 
Seq. 


2108 


o 
o 

Ov 


1517 


1003 


to 

ON 


2180. 


5' NT 

of 
Clone 
Seq. 


1223 


CM 

oo 


CO 
oo 




cs 


1607 


Total 
NT 
Seq. 


2120 


s 

On 


1517 


1003 


3865 


2196 






VO 


OO 
OO 
CN 


CM 
VO 


On 
OO 
CM 


CO 

^0 




XR 


XR 


XR 


XR 




XR 


ipt 


Vectoj 


Uni-ZAP 


Uni-ZAP 


Uni-ZAP 

j 


Uni-ZAP 




Uni-ZAP 


pBluescr 


ATCG 
Deposit 
No:Z 
and Date 


97903 
02/26/97 
209049 

XJJl I J/ 7 / 


97903 
02/26/97 
209049 

Uj/ iJ/7/ 


97903 
02/26/97 
209049 
05/1 '5/97 

yj^f i^Ji y f 


97903 
02/26/97 
209049 

yjjf iJ/y 1 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 

U j/ 1 jiy 1 


cDNA 
Clone ID 


HSHCC16 


HTLEF62 


HTLEF62 


HTLAD94 


HTLAD94 


HTSFQ12 


Gene 
No. 


o 


to 






CM 


CO 



ISDCXaO: <WO_9839448A?J_> 



wo 98/39448 PCTAJS98/04493 

156 





VO 
ON 


On 
VO 


Ov 
CO 


oo 
o 

CO 




CO 

CN 


Predicted 
First AA 

of 
Secreted 
Portion 






CO 






CN 
CO 




CO 
VO 


ON 
CO 


o 

CO 


*n 




CO 






























VO 
VO 


ON 

in 


VO 


CO 
ON 

m 


oo 
VO 


ON 

VO 


5' NT 
of 

AAof 
Signal 
Pep 


Ov 
CM 


uo 
»n 

CO 


OO 

»n 

CN 


xn 

CN 

in 


CO 


OO 
CN 


NT 
J 111 

of 

Start 
Codon 


as 
as 


m 
in 
CO 


oo 
in 






OO 
CN 


3" NT 

of 
Clone 
Seq. 


1840 


1818 


2871 


2838 


2221 


1816 


5' NT 

of 
Clone 
Seq. 


CM 


ON 


o\ 
oo 


VO 
oo 


CO 
CO 


1130 


Total 
NT 
Seq. 


1945 


1910 


2933 


NO 

r-- 

CM 
CO 


2243 


1816 






O 

ON 


VO 


ON 
CN 


\o 

VO 


VO 




oi 
X 


X 


ai 
X 


X 


X 


a: 

X 


Vectoi 


i-ZAP 


i-ZAP 


i-ZAP 


i-ZAP 


i-ZAP 


i-ZAP 




B 

D 


B 

ID 


B 

D 


e 
D 


s 


c 

:d 


ATCC 
Deposit 
No:Z 
and Date 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 
05/15/97 


97903 
02/26/97 
209049 
05/1 5/97 


97903 
02/26/97 
209049 
05/1 5/97 


97903 
02/26/97 
209049 

u J/ uty t 


cDNA 
Clone ID 


HE6FL83 


CO 
oo 

VO 
OQ 


HTXFJ55 


HTXFJ55 


HJPCJ76 


HLTED27 


Gene 
No. 




in 


*n 
m 


m 
»n 


VO 

«n 


m 



wo 98/39448 PCTAJS98/04493 

157 







CN 
ON 


CO 


»o 
o 




CO 


Predicted 
First AA 

01 

Secreted 
Portion 




OS 


r- 


VO 










oo 


VO 


lO 




CO 


















ON 


o 




^ 


iO 

ON 

»o 


CO 


5' NT 
of 

Pircf 

AAof 
Signal 
Pep 


1306 


oo 
o 


a\ 


1001 


o 
<o 


1722 


5' NT 

mJ ill 

of 

Start 
Codon 


1306 


oo 
o 


ON 


1001 


o 
»o 


1722 


3' NT 

of 
Clone 
Seq. 


1548 


oo 
t — 


VO 

oo 


1869 


1501 


2100 


5' NT 

of 
Clone 
Seq. 


1098 




VO 


oo 
On 


oo 

CO 


1642 


Total 
NT 
Seq. 


1695 


*o 


o 

a\ 


1883 


1501 


2100 




o\ 


oo 


ON 
NO 


o 


m 

On 




'XT i 

Vector 


Uni-ZAP XR 


pSportl 


pBluescript 


Uni-ZAP XR 


Uni-ZAP XR 


pBluescript 
SK- 


ATCC 
Deposit 
No:Z 
and Date 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 


97903 
02/26/97 

209049 
05/15/97 


97903 
02/26/97 
209049 
05/15/97 


97903 
02/26/97 
209049 
05/15/97 


97903 
02/26/97 
209049 
05/15/97 


cDNA 
Clone ID 


HLTED27 


HMKBA64 

1 


HNFIP24 


HCELB21 


HCELB21 


HAWBA28 


j 

1 
1 

Gene 
No. 




OO 

to 


ON 

*o 


o 

VO 


o 

\o 


NO 



SDOaO: <WO_983944aA2_L> 



wo 98/39448 PCT/US98/04493 

158 









cn 




ON 
CN 

cn 


CO 


Predicted 
First AA 

01 

Secreted 
Portion 


cn 




cn 




CN 
CN 






o 
cn 




cn 




CN 






















VO 
OS 




o\ 
»o 


VO 


00 

yn 


5' NT 
of 

AAof 
Signal 
Pep 




cn 


CN 
CN 


VO 
Jv 




m 
o 

CO 


S' NT 
J i> 1 

of 

Start 
Codon 


»o 

VO 


cn 


CN 


VO 




o 
cn 


3' NT 

of 
Clone 
Seq. 


1930 


2683 


1451 


1420 


2972 


oo 

OO 


5' NT 

of 
Clone 
Seq. 


oo 


cn 
oo 


VO 

o\ 


ON 


2197 


CN 

»n 


Total 
NT 
Seq. 


1930 


2683 


1509 


1454 


3173 


oo 
oo 




r~- 




cn 


ON 

CN 


^ 


VO 

ON 

CN 


Vector 


pBluescript 
SK- 


pBluescript 
SK- 


pBluescript 
SK- 


pBluescript 
SK- 


Uni-ZAPXR 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97903 
02/26/97 
209049 

AC/1 </Q7 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 


97903 

02/26/97 

209049 
n^/ IS/07 


97903 
02/26/97 
209049 


cDNA 
Clone ID 


HSAAS44 


HSAAS44 


HAFAL73 


HAFAL73 


HSAWF26 


HSAWF26 


Gene 
No. 




CN 
VO 


cn 
VO 


cn 

VO 


VO . 


VO 



wo 98/39448 



159 



PCT/US98/04493 







to 




VO 






Predicted 
First AA 
or 

Secreted 
Portion 




ON 




<s 








CM 


OO 


in 


<N 


c3 




^ ^ 'S S* 


















ON 

On 
to 


OO 


ON 


o 
oo 


8 

VO 


5' NT 
of 

AAof 
Signal 
Pep 


o 

VO 


1473 


oo 
oo 
VO 


CO 

r- 






5' NT 

ill 

of 

Start 
Codon 


o 
\o 


1473 




CO 






3' NT 

of 
Clone 
Seq. 


o 

ON 


2413 


1290 

1 


2290 


ON 

to 


to 
to 


5' NT 

of 
Clone 
Seq. 


CO 


1387 


ON 








Total 
NT 
Seq. 


*— « 
o\ 

ON 


2416 


1290 


2290 


ON 

to 


to 
to 






On 


so 

T— ♦ 


r- 


oo 


oo 

On 


Vector 


Uni-ZAP XR 


Uni-ZAP XR 


Uni-ZAP XR 


pBluescript 
SK- 


X 

< 

'S 


Uni-ZAP XR 


ATCC 
Deposit 
No:Z 
and Date 


97903 
02/26/97 
209049 

Vj/iJ/y 1 


97903 
02/26/97 
209049 

\JJI 1 J/7/ 


97903 
02/26/97 
209049 
OS/ 1^/97 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 


97903 
02/26/97 
209049 


cDNA 
Clone ID 


HEAAL31 


cn 

1 


HFKFX55 

1 


H2LA01 1 


to 

ON 

N 
Q 

uu 

Ol. 


HPFDZ95 


Gene 
No. 


VO 


\o 


VO 
VO 


r- 

VO 


oo 

VO 


oo 

VO 



ISOOCID: <WO_9839448A2J_> 



wo 98/39448 



160 



PCTAJS98/04493 





ON 


O 


ro 


DO 


On 
rM 


OO 

CO 


Predicted 
First AA 

01 

Secreted 
Portion 






t — 


cn 


Ti- 
cs 


o 

CO 




O 
m 


ro 
CM 


SO 
CM 


O 

m 


ro 

CS 


ON 






























oo 


o 
<a 


cs 
oo 


00 


CM 
O 
VO 


oo 


yNT 

of 

AAof 
Signal 
Pep 


ON 


cs 

VO 

in 


On 
On 


?5 


8 

CO 




NT 
J ill 

of 

Start 
Codon 


o\ 


CM 
VO 


»n 

ON 
ON 




8 

CO 




3' NT 

of 
Clone 
Seq. 


1352 


1530 


1250 


LLL 


r- 

On 
On 


ON 


5" NT 

of 
Clone 
Seq. 




oo 
m 


oo 
o\ 




s 




Total 
NT 
Seq. 


1509 


1530 


1316 


r- 
r- 


ON 
ON 


ON 




On 


ON 
ON 

cs 


O 
OO 


oo 


o 
o 
CO 


oo 


Vector 


-ZAPXR 


rZAPXR 


)Sportl 


i-ZAPXR 


i-ZAP XR 


i-ZAPXR 




'g 


C 




c 


a 


c 
D 


ATCC 
Deposit 
No:Z 
and Date 


97904 
02/26/97 
209050 

AC/1 c/c\n 


97904 
02/26/97 
209050 

nc/1 C/07 


97904 

02/26/97 

209050 
nc/1 </07 


97904 
02/26/97 
209050 

AC/1 C/07 


97904 
02/26/97 
209050 


97904 
02/26/97 
209050 


cDNA 
Clone ID 


HPTTUll 


HPTTUll 


HCFAE79 


HTEDJ34 


HTEDJ34 


VO 

o 

O 
K 


Gene 
No. 


On 

SO 


ON 

VO 


o 






<N 



wo 98/39448 PCT/US98/04493 

161 







VO 
CO 


VO 


CO 
CM 


CN 


o 
o 


Predicted 
First AA 
or 

Secreted 
Portion 




uo 

CN 


•o 


CO 


O 


CO 




o 
cs 


CN 


<o 


o 

CO 




o 

CO 


















oo 


VO 
OO 


CO 

o 

VO 


oo 


s 

VO 


oo 
oo 


5' NT 
of 

All Ol 

AAof 
Signal 
Pep 


40 

*o 


CO 


CO 

CO 
CN 


VO 


o 

VO 


»o 

CN 


5' NT 
of 
Start 
Codon 


m 
*o 


CO 


CO 

CO 

CM 


VO 




»o 

CN 


3' NT 

of 
Clone 
Seq. 


1405 


1596 


2345 


2288 


1946 


1180 


5' NT 

of 
Clone 
Seq. 


VO 


to 


»o 


to 
»o 

CO 


CM 


VO 


Total 
NT 
Seq. 


1405 


1596 


2345 


2293 


2369 


1212 




CO 
oo 


oo 


o 


oo 


CN 
O 
CO 


VO 
oo 


Vector 


Uni-ZAPXR 


pBluescript 
SK- 


pBluescript 
SK- 


a: 

X 

c: 


X 

t 

c 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97904 
02/26/97 
209050 

\jjf i j/y / 


97904 
02/26/97 
209050 


97904 
02/26/97 
209050 
05/15/97 


97904 
02/26/97 
209050 

UJ/1 J/7 / 


97904 
02/26/97 
209050 

yjjlLjiyi 


97904 
02/26/97 

209050 
\jjf ijfy I 


cDNA 
Clone ID 


HFTAR26 


H2MBF44 


\ H2MBF44 


CN 

oo 
tQ 


HE8BI92 


HFTBR48 


Gene 
No. 


CO 






KO 


lO 


VO 



ISDOaD: <WO_983944aA2_L> 



wo 98/39448 PCTAJS98/04493 

162 







r% ( 








m 

CN 


Predicted 
First AA 

01 

Secreted 
Portion 










OS 






CO 

cs 






m 


ao 




















o 
<o 


ON 
DO 


O 

<o 


O 
OS 




% 


5'Nf 
of 

Pi ret 
nidi 

AAof 
Signal 
Pep 


ro 

so 
SO 


SO 


DO 


oo 




o 


NT 

J 1^ 1 

of 

Start 
Codon 


CO 
VO 
VO 


so 
SO 




oo 






3' NT 

of 
Clone 
Seq. 


1149 


^ 


1515 


1516 




oo 

VO 


5' NT 

of 
Clone 
Seq. 




OZ.Z, 


OS 


o 

VO 
OS 




r-- 
oo 


Total 
NT 
Seq. 


1181 


1605 


1537 


1516 


1493 


oo 
VO 




o 
m 


OO 


o 

cn 


oo 
oo 


in 
o 


OS 
OO 


Vector 


Uni-ZAPXR 

1 


X 

Oh 

c 


Uni-ZAPXR 


Uni-ZAPXR 


a: 
X 

< 

c 


Uni-ZAPXR 


ATCC 
Deposit 
No:Z 
and Date 


97904 
02/26/97 
209050 

AC /I C/OT 


97904 
02/26/97 
209050 

uj/ ij/y / 


97904 
02/26/97 
209050 


97904 
02/26/97 
209050 


97904 
02/26/97 
209050 

UJ/ 1 j/y / 


97904 
02/26/97 
209050 


cDNA 
Clone ID 


oo 

S 


S 
u 

OS 

a 
ac 


HE9CM64 


HATAV51 


HATAV51 


HAQAF27 


Gene 
No, 


VO 


r- 




OO 

r 


OO 

•h4 


OS 



JDOCID: <WO__983944aA2J_;^ 



wo 98/39448 PCT/US98/04493 

163 







VO 


ON 


CM 
CO 
CS 


CO 


CO 

C3N 


Predicted 
First AA 

01 

Secreted 
Portion 


CO 




CO 


CO 


CO 
CO 


'"^ 
CO 




o 

CO 




o 

CO 


o 

CO 


CN 
CO 


CO 
CO 




f— t 














<N 


oo 
o 

VO 


CO 
ON 

Tj- 


o\ 
o 

so 


o 


o\ 


5' NT 
of 

AAof 
Signal 
Pep 


o 

CO 




1153 


cs 


CM 

o 

CO 




5' NT 

J 111 

of 
Start 
Codon 


o 

so 
CO 




1153 




CM 

o 

CO 




3' NT 

of 
Clone 
Seq. 


1014 


r- 


2630 


2860 


VO 
00 


1923 


5' NT 

of 
Clone 
Seq. 


CO 

o 




2207 


CO 
VO 


cs 


o 

CO 


Total 
NT 
Seq. 


1014 


r- 
wo 


2779 


2860 


VO 
r- 
oo 


1923 

! 




o 


\o 
o 

ro 


ON 




oo 
o 

CO 


CM 


Vector. 


Uni-ZAP XR 


Uni-ZAP XR 


pBluescript 
SK- 


pBluescript 
SK- 


pBluescript 
SK- 


Uni-ZAP XR 


ATCC 
Deposit 
No;Z 
and Date 


97904 
02/26/97 
209050 


97904 
02/26/97 
209050 

Uj/1 j/7/ 


97904 
02/26/97 
209050 

UJ/ I J/7 ( 


97904 
02/26/97 
209050 


97904 
02/26/97 
209050 

n^/l S/Q7 


97904 
02/26/97 
209050 
yjji iJiy 1 


cDNA 
Clone ID 


HCEEK08 


HCEEK08 


HAFAU18 


HAFAU18 


HAFAU18 


HETBY74 


Gene 
No. 


o 

oo 


o 

oo 


OO 


OO 


OO 


CM 
OO 



SOOC^ID: <WO Q83S44aA2 I > 



wo 98/39448 



164 



PCTAJS98/04493 





o 






ON 




ON 
CN 


Predicted 
First AA 

01 

Secreted 
Portion 


m 


ON 






CO 






O 
m 


OO 






o 

CO 


oo 


















<o 


VO 


VO 
0\ 


VO 


r- 


CO 

VO 


5' NT 
of 

AAof 
Signal 
Pep 


oo 


On 




2131 


ON 


o 


5' NT 

^ 111 

of 

Start 
Codon 


oo 


ON 


CO 




o\ 

<N 




3' NT 

of 
Clone 
Seq. 


2286 


2025 


3054 


VO 

o 

CO 


o 

ON 




5" NT 

of 
Clone 
Seq. 


0911 


o 
oo 


2004 


1966 


CN 


VO 


Total 
NT 
Seq. 


VO 

m 
cs 


2025 


3054 


3026 


xr^ 
O 

ON 


CS 

r- 




CO 
On 


ON 

o 
m 


? 


o 
cn 


ON 


CO 




XR 




XR 


XR 


XR 


XR 


XR 


Vectoi 


i-ZAP 




Uni-ZAP 


i-ZAP 


i-ZAP 


-ZAP 


-ZAP 




Uni 




Uni 


B 


Uni 


Uni 


ATCC 

Deposit 
No:Z 
and Date 


97904 
02/26/97 
209050 


05/15/97 


97904 
02/26/97 
209050 

up/ ij/yi 


97904 
02/26/97 
209050 
05/15/97 


97904 
02/26/97 
209050 
05/15/97 


97904 
02/26/97 

209050 
05/15/97 


97904 
02/26/97 
209050 
05/1 5/97 


cDNA 
Clone ID 


HTOAF35 


HTOAF35 


HCRBX32 


HCRBX32 


HEBGB80 


HEBGB80 


Gene 
No. 


m 
oo 


CO 
OO 


OO 


OO 


oo 


oo 



5DOCID: <WO_983944eA2J_> 



wo 98/39448 PCT/US98/04493 

165 







w 
CO 


Predicted 
First AA 
or 

Secreted 
Portion 




ON 




o 
m 


oo 














oo 


2; 

NO 


5' NT 
of 

AAof 
Signal 
Pep 


CN 

cs 


ON 


5' NT 
J ill 

of 

Start 
Codon 


\r\ 
cs 
<N 


r- 

CM 

o\ 


3' NT 

of 
Clone 
Seq. 


ON 

o 
oo 


1289 


5' NT 

of 
Clone 
Seq. 


op 


oo 


Total 
NT 
Seq. 


1290 


1289 






CN 

m 


Vector 


i-ZAPXR 


i-ZAPXR 




c 




ATCC 
Deposit 
No:Z 
and Date 


97904 
02/26/97 
209050 

U 0/ 1 J/ 7 / 


97904 
02/26/97 
209050 








cDNA 
Clone IE 


HFAMH^ 


HFAMH1 


Gene 
No. 


vo 
oo 


oo 



ISDOCID: <WO_9839448A2J_> 



wo 98/39448 PCTAJS98/04493 

166 

Table I summarizes the infomiation corresponding to each "Gene No." 
described above. The nucleotide sequence identified as "NT SEQ ID NO:X" was 
assembled from partially homologous ("overlapping") sequences obtained from the 
"cDNA clone DD" identified in Table 1 and, in some cases, from additional related DNA 

5 clones. The overlapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overlapping sequences at each nucleotide 
position), resulting in a final sequence identified as SEQ ID NO:X. 

The cDN A Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits contain 

10 multiple different clones corresponding to the same gene. "Vector" refers to the type of 
vector contained in the cDNA Clone ID. 

'Total NT Seq." refers to the total number of nucleotides in the contig identified 
by "Gene No." The deposited clone may contain all or most of these sequences, 
reflected by the nucleotide position indicated as "5' NT of Clone Seq." and the "3* NT 

1 5 of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID NO:X of the 
putative start codon (methionine) is identified as "5' NT of Start Codon." Similarly , 
the nucleotide position of SEQ ID NO:X of the predicted signal sequence is identified as 
"5' NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is identified 

20 as "AA SEQ ID NO: Y," although other reading frames can also be easily translated 
using known molecular biology techniques. The polypeptides produced by these 
alternative open reading frames are specifically contemplated by the present invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The predicted 

25 first amino acid position of SEQ ID NO: Y of the secreted portion is identified as 

"Predicted First AA of Secreted Portion." Finally, the amino acid position of SEQ ID 
NOiY of the last amino acid in the open reading frame is identified as "Last AA of 
ORF." 

SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently accurate and 
30 otherwise suitable for a variety of uses well known in the art and described further 

below. For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NO:X or the cDN A 
contained in the deposited clone. These probes will £dso hybridize to nucleic acid 
molecules in biological samples, thereby enabling a variety of forensic and diagnostic 
35 methods of the invention. Similarly, polypeptides identified from SEQ ED NO: Y may 
be used to generate antibodies which bind specifically to the secreted proteins encoded 
by the cDNA clones identified in Table 1. 
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Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino acid 
5 sequence. In these cases, the predicted amino acid sequence diverges from the actual 
amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion 
in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 

10 sequence or the amino acid sequence, the present invention provides not only the 

generated nucleotide sequence identified as SEQ ID NO:X and the predicted translated 
amino acid sequence identified as SEQ ID NO: Y, but also a sample of plasmid DNA 
containing a human cDNA of the invention deposited with the ATCC, as set forth in 
Table 1. The nucleotide sequence of each deposited clone can readily be determined by 

15 sequencing the deposited clone in accordance with known methods. The predicted 
amino acid sequence can then be verified from such deposits. Moreover, the amino 
acid sequence of the protein encoded by a particular clone can also be directly 
determined by peptide sequencing or by expressing the protein in a suitable host cell 
containing the deposited human cDNA, collecting the protein, and determining its 

20 sequence. 

The present invention also relates to the genes corresponding to SEQ ID NO:X, 
SEQ ID NO: Y, or the deposited clone. The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. 
Such methods include preparing probes or primers from the disclosed sequence and 
25 identifying or amplifying the corresponding gene from appropriate sources of genomic 
material. 

Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from the 
sequences provided herein and screening a suitable nucleic acid source for the desired 
30 homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well 
35 understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). 
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It is often advantageous to include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 
5 The polypeptides of the present invention are preferably provided in an isolated 

form, and preferably are substantially purified. A recombinantly produced version of a 
polypeptide, including the secreted polypeptide, can be substantially purified by the 
one-step method described in Smith and Johnson, Gene 67:31-40 (1988). 
Polypeptides of the invention also can be purified from natural or recombinant sources 
10 using antibodies of the invention raised against the secreted protein in methods which 
are well known in the art. 

Signal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the 

15 cleavage point for that sequence, are available. For instance, the method of McGeoch, 
Virus Res. 3:271-286 (1985), uses the information from a short N-terminal charged 
region and a subsequent uncharged region of the complete (uncleaved) protein. The 
method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the information 
from the residues surrounding the cleavage site, typically residues -13 to +2, where +1 

20 indicates the amino terminus of the secreted protein. The accuracy of predicting the 

cleavage points of known manunalian secretory proteins for each of these methods is in 
the range of 75-80%. (von Heinje, supra.) However, the two methods do not always 
produce the same predicted cleavage point(s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide 

25 was analyzed by a computer program called SignalP (Henrik Nielsen et al.. Protein 
Engineering 10:1-6 (1997)), which predicts the cellular location of a protein based on 
the amino acid sequence. As part of this computational prediction of localization, the 
methods of McGeoch and von Heinje are incorporated. The analysis of the amino acid 
sequences of the secreted proteins described herein by this program provided the results 

30 shown in Table 1. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO:Y which have an N-terminus beginning within 5 residues (i.e., + 

35 or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that in 
some cases, cleavage of the signal sequence from a secreted protein is not entirely 
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uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the naturally 
5 occurring signal sequence may be further upstream from the predicted signal sequence. 
However, it is likely that the predicted signal sequence will be capable of directing the 
secreted protein to the ER. These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention. 

10 Polynucleotide and Polypeptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential properties 
thereof. Generally, variants are overall closely similar, and, in many regions, identical 
to the polynucleotide or polypeptide of the present invention. 

15 "Identity" per se has an art-recognized meaning and can be calculated using 

published techniques. (See, e.g.: (COMPUTATIONAL MOLECULAR BIOLOGY, 
Lesk. A.M., ed., Oxford University Press, New York, (1988); BIOCOMPUTING: 
INFORMATICS AND GENOME PROJECTS, Smith, D.W., ed.. Academic Press, 
New York, (1993); COMPUTER ANALYSIS OF SEQUENCE DATA, PART I, 

20 Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, (1994); 

SEQUENCE ANALYSIS IN MOLECULAR BIOLOGY, von Heinje, G., Academic 
Press, (1987); and SEQUENCE ANALYSIS PRIMER, Gribskov, M. and Devereux, 
J., eds., M Stockton Press, New York, (1991).) While there exists a number of 
methods to measure identity between two polynucleotide or polypeptide sequences, the 

25 term "identity" is well known to skilled artisans. (Carillo, H., and Lipton, D., SIAM J 
Applied Math 48:1073 (1988).) Methods commonly employed to determine identity or 
similarity between two sequences include, but are not limited to, those disclosed in 
"Guide to Huge Computers," Martin J. Bishop, ed.. Academic Press, San Diego, 
(1994), and Carillo, H., and Lipton. D., SIAM J Applied Math 48:1073 (1988). 

30 Methods for aligning polynucleotides or polypeptides are codified in computer 

programs, including the GCG program package (Etevereux, L, et al.. Nucleic Acids 
Research (1984) 12(1):387 (1984)), BLASTP, BLASTN, FASTA (Atschul, S.F. et 
al., J. Molec. Biol. 215:403 (1990), Bestfit program (Wisconsin Sequence Analysis 
Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 

35 575 Science Drive, Madison, WI 5371 1 (using the local homology algorithm of Smith 
and Waterman, Advances in Applied Mathematics 2:482-489 (1981).) 
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When using any of the sequence alignment programs to determine whether a 
particular sequence is, for instance, 95% identical to a reference sequence, the 
parameters are set so that the percentage of identity is calculated over the full length of 
the reference polynucleotide and that gaps in identity of up to 5% of the total number of 
5 nucleotides in the reference polynucleotide are allowed. 

A preferred method for determing the best overall match between a query 
sequence (a sequence of the present invention) and a subject sequence, also referred to 
as a global sequence alignment, can be determined using the FASTDB computer 
program based on the algorithm of Bratlag et al. (Comp. App. Biosci. 6:237-245 
10 (1990).) The term "sequence" includes nucleotide and amino acid sequences. In a 
sequence alignment the query and subject sequences are either both nucleotide 
sequences or both amino acid sequences. The result of said global sequence alignment 
is in percent identity. Preferred parameters used in a FASTDB search of a DNA 
sequence to calculate percent identiy are: Matrix=Unitary, k:-tuple=4. Mismatch 
1 5 Penalty= 1 , Joining Penalty=30, Randomization Group Length=0, and Cutoff Scores 1 , 
Gap Penal ty=5. Gap Size Penalty 0.05, and Window Size=500 or query sequence 
length in nucleotide bases, whichever is shorter. Preferred parameters employed to 
calculate percent identity and sinnilarity of an amino acid alignment are: Matrix=PAM 
150, k-tuple=2. Mismatch Penalty=l, Joining Penal ty=20, Remdomization Group 
20 Length=0, Cutoff Score=l , Gap Penalty =5, Gap Size Penalty=0.05, and Window 
Size=500 or query sequence length in amino acid residues, whichever is shorter. 

As an illustration, a polynucleotide having a nucleotide sequence of at least 95% 
"identity" to a sequence contained in SEQ ID NO:X or the cDNA contained in the 
deposited clone, means that the polynucleotide is identical to a sequence contained in 
25 SEQ ID NO:X or the cDNA except that the polynucleotide sequence may include up to 
five point mutations per each 100 nucleotides of the total length (not just within a given 
100 nucleotide stretch). In other words, to obtain a polynucleotide having a nucleotide 
sequence at least 95% identical to SEQ ID NO:X or the deposited clone, up to 5% of the 
nucleotides in the sequence contained in SEQ ID NO:X or the cDNA can be deleted, 
30 inserted, or substituted with other nucleotides. These changes may occur anywhere 
throughout the polynucleotide. 

Further embodiments of the present invention include polynucleotides having at 
least 85% identity, more preferably at least 90% identity, and most preferably at least 
95%, 96%, 97%, 98% or 99% identity to a sequence contained in SEQ ID NO:X or the 
35 cDNA contained in the deposited clone. Of course, due to the degeneracy of the genetic 
code, one of ordinary skill in the art will inmiediately recognize that a large number of 
the polynucleotides having at least 85%, 90%, 95%, 96%, 97%. 98%, or 99% identity 
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will encode a polypeptide identical to an amino acid sequence contained in SEQ ID 
NO: Y or the expressed protein produced by the deposited clone. 

Similarly, by a polypeptide having an amino acid sequence having at least, for 
example, 95% "identity" to a reference polypeptide, is intended that the amino acid 
5 sequence of the polypeptide is identical to the reference polypeptide except that the 

polypeptide sequence may include up to five amino acid alterations per each 100 amino 
acids of the total length of the reference polypeptide. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference amino 
acid sequence, up to 5% of the amino acid residues in the reference sequence may be 

10 deleted or substituted with another amino acid, or a number of amino acids up to 5% of 
the total amino acid residues in the reference sequence may be inserted into the reference 
sequence. These alterations of the reference sequence may occur at the amino or 
carboxy terminal positions of the reference amino acid sequence or anywhere between 
those terminal positions, interspersed either individually among residues in the 

15 reference sequence or in one or more contiguous groups within the reference sequence. 

Further embodiments of the present invention include polypeptides having at 
least 80% identity, more preferably at least 85% identity, more preferably at least 90% 
identity, and most preferably at least 95%, 96%, 97%, 98% or 99% identity to an 
amino acid sequence contained in SEQ ID NO: Y or the expressed protein produced by 

20 the deposited clone. Preferably, the above polypeptides should exhibit at least one 
biological activity of the protein. 

In a preferred embodiment, polypeptides of the present invention include 
polypeptides having at least 90% similarity, more preferably at least 95% sinailarity, and 
still more preferably at least 96%, 97%, 98%, or 99% similarity to an amino acid 

25 sequence contained in SEQ ID NO: Y or the expressed protein produced by the 
deposited clone. 

The variants may contain alterations in the coding regions, non-coding regions, 
or both. Especially preferred are polynucleotide variants containing alterations which 
produce silent substitutions, additions, or deletions, but do not alter the properties or 

30 activities of the encoded polypeptide. Nucleotide variants produced by silent 

substitutions due to the degeneracy of the genetic code are preferred. Moreover, 
variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or added in any 
combination are also preferred. Polynucleotide variants can be produced for a variety 
of reasons, e,g., to optimize codon expression for a particular host (change codons in 

35 the human mRNA to those preferred by a bacterial host such as E. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 
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organism. (Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 
Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

5 Using known methods of protein engineering and recombinant DNA 

technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 
deleted from the N-tenninus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et al., J. Biol. Chem. 268:. 2984-2988 

10 (1993), reported variant KGF proteins having heparin binding activity even after 

deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues from the 
carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 7:199-216 (1988).) 
Moreover, ample evidence demonstrates that variants often retain a biological 

15 activity similar to that of the naturally occurring protein. For example, Gayle and 

coworkers (J. Biol. Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 
3,5(X) individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every possible 

20 amino acid position. The investigators found that "[m]ost of the molecule could be 

altered with littie effect on either [binding or biological activity]." (See, Abstract.) In 
fact, only 23 unique amino acid sequences, out of more than 3,500 nucleotide 
sequences examined, produced a protein that significanUy differed in activity from wild- 
type. 

25 Furthermore, even if deleting one or more amino acids from the N-terminus or 

C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, otiier biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form 
will likely be retained when less than the majority of the residues of the secreted form 

30 are removed from the N-terminus or C-tennfiinus. Whether a particular polypeptide 

lacking N- or C-terminal residues of a protein retains such immunogenic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. 

Thus, the invention further includes polypeptide variants which show 
35 substantial biological activity. Such variants include deletions, insertions, inversions, 
repeats, and substitutions selected according to general rules known in the art so as 
have little effect on activity. For example, guidance concerning how to make 
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phenotypically silent amino acid substitutions is provided in Bowie, J. U. et al.. 
Science 247: 1306-13 10 (1990), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
5 selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 
positions are not critical for protein function. Thus, positions tolerating amino acid 

10 substitution could be modified while still maintaining biological activity of the protein. 

The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For 
example, site directed mutagenesis or alanine-scanning mutagenesis (introduction of 
single alanine mutations at every residue in the molecule) can be used. (Cimningham 

15 and Wells, Science 244:1081-1085 (1989).) The resulting mutant molecules can then 
be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
amino acid changes are likely to be permissive at certain amino acid positions in the 

20 protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val, Leu and He; 
replacement of the hydroxyl residues Ser £uid Thr; replacement of the acidic residues 

25 Asp and Glu; replacement of the amide residues Asn and Gin, replacement of the basic 
residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, and Trp, 
and replacement of the small-sized amino acids Ala, Ser, Thr. Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved amino acid residues, 

30 where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substituent group, or (iii) fusion of the mature polypeptide with another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 

35 acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 
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For example, polypeptide variants containing amino acid substitutions of 
charged amino acids with other charged or neutral amino acids may produce proteins 
with improved characteristics, such as less aggregation. Aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's 
5 immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 (1967); 
Robbins et al.. Diabetes 36: 838-845 (1987); Cleland et al., Crit, Rev. Therapeutic 
Drug Carrier Systems 10:307-377 (1993).) 

Polynucleotide and Polypeptide Fragments 

10 In the present invention, a "polynucleotide fragment" refers to a short 

polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 
15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 nt. 
and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt in 

15 length," for example, is intended to include 20 or more contiguous bases from the 

cDN A sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and primers 
as discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 2(X)0 
nucleotides) are preferred. 

20 Moreover, representative examples of polynucleotide fragments of the 

invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 
450, 451-500, 501-550, 551-600, 651-700, and 701 to the end of SEQ ID NO:X or the 
cDNA contained in the deposited clone. In this context "about" includes the particularly 

25 recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either 

terminus or at both termini. Preferably, these fragments encode a polypeptide which 
has biological activity. 

In the present invention, a "polypeptide fragment" refers to a short amino acid 
sequence contained in SEQ ID NO:Y or encoded by the cDNA contained in the 

30 deposited clone. Protein fragments may be "free-standing," or comprised within a 
larger polypeptide of which the fragment forms a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 21-40, 
41-60, 61-80, 81-100, 102-120, 121-140, 141-160, and 161 to the end of the coding 

35 region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 1 10, 120, 130, 140, or 150 amino acids in length. In this context "about" 
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includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
aniino acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted protein as well as the 
mature form. Further preferred polypeptide fragments include the secreted protein or 
5 the mature form having a continuous series of deleted residues from the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from 1- 
60, can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1-30, can be deleted 
from the carboxy terminus of the secreted protein or mature form. Furthermore, any 
10 combination of the above amino and carboxy terminus deletions are preferred. 

Similarly, polynucleotide fragments encoding these polypeptide fragments are also 
preferred. 

Also preferred are polypeptide and polynucleotide fragments characterized by 
structural or functional domains, such as fragments that comprise alpha-helix and alpha- 

15 helix forming regions, beta-sheet and beta-sheet-forming regions, turn and tum- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 
forming regions, substrate binding region, and high antigenic index regions. 
Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are 

20 specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are eilso contemplated. 

Other preferred fragments are biologically active fragments. Biologically active 
fragments are those exhibiting activity similar, but not necessarily identical, to an 
activity of the polypeptide of the present invention. The biological activity of the 

25 fragments may include an improved desired activity, or a decreased undesirable activity. 

Epitopes & Antibodies 

In the present invention, "epitopes'* refer to polypeptide fragments having 
antigenic or immunogenic activity in an animal, especially in a human. A preferred 

30 embodiment of the present invention relates to a polypeptide fragment comprising an 
epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
molecule to which an antibody can bind is defined as an "antigenic epitope." In 
contrast, an "immunogenic epitope" is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et al., Proc. Natl. Acad. Sci. USA 

35 81:3998-4002 (1983).) 
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Fragments which function as epitopes may be produced by any conventional 
means. (See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 
(1985) further described in U.S. Patent No. 4,631,211.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 

5 least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson et 
al.. Cell 37:767-778 (1984); Sutcliffe, J. G. et al., Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies according to 

10 methods well known in the art. (See, for instance, Sutcliffe et al., supra; Wilson et al., 
supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et 
al., J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
the secreted protein. The immunogenic epitopes may be presented together with a 
carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if 

15 it is long enough (at least about 25 amino acids), without a carrier. However, 

immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be 
sufficient to raise antibodies capable of binding to, at the very least, linear epitopes in a 
denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 

20 meant to include intact molecules as well as antibody fragments (such as, for example. 
Fab and F(ab*)2 fragments) which are capable of specifically binding to protein. Fab 
and F(ab')2 fragments lack the Fc fragment of intact antibody, clear more rapidly from 
the circulation, and may have less non-specific tissue binding than an intact antibody. 
(Wahl et al., J. Nucl, Med. 24:316-325 (1983).) Thus, these fragments are preferred, 

25 as well as the products of a FAB or other immunoglobulin expression library. 

Moreover, antibodies of the present invention include chimeric, single chain, and 
humanized antibodies. 

Fusion Proteins 

30 Any polypeptide of the present invention can be used to generate fusion 

proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
polypeptide of the present invention can be used to indirectly detect the second protein 
by binding to the polypeptide. Moreover, because secreted proteins target cellular 

35 locations based on trafficking signals, the polypeptides of the present invention can be 
used as targeting molecules once fused to other proteins. 
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Examples of domains that can be fused to polypeptides of the present invention 
include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through 
linker sequences. 

5 Moreover, fusion proteins may also be engineered to improve characteristics of 

the polypeptide of the present invention. For instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence during purification from the host cell or 
subsequent handling and storage. Also, peptide moieties may be added to the 

10 polypeptide to facilitate purification. Such regions may be removed prior to final 

preparation of the polypeptide. The addition of peptide moieties to facilitate handling of 
polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 

15 immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported example 
describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains of 
mammalian immunoglobulins. (EP A 394,827; Traunecker et al.. Nature 33 1 :84-86 

20 (1988).) Fusion proteins having disulfide-linked dimeric structures (due to the IgG) 
can also be more efficient in binding and neutralizing other molecules, than the 
monomeric secreted protein or protein fragment alone. (Fountoulakis et al., J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 

25 proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a 
fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 

30 would be desired. For example, the Fc portion may hinder therapy and diagnosis if the 
fusion protein is used as an antigen for immunizations. In drug discovery, for 
example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See, D. 
Bennett et ah, J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. Biol. 

35 Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
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preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, 
such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 91311), among others, many of which aire commercially available. 
As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 
5 instance, hexa-histidine provides for convenient purification of the fusion protein. 
Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al.. Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the claimed invention. 

10 

Vectors, Host Cells, and Protein Production 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
15 vector. Retroviral vectors may be replication competent or replication defective. In the 
latter case, viral propagation generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is 
20 a virus, it may be packaged in vitro using an appropriate packaging cell line and then 
transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac 
promoters, the S V40 early and late promoters and promoters of retroviral LTRs, to 
25 name a few. Other suitable promoters will be known to the skilled artisein. The 

expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA, UGA or 
30 UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaryotic cell culture and teuracycline, kanaraycin or ampicillin resistance 
genes for culturing in E. coli and other bacteria. Representative examples of 
35 appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 

Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, 
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293, and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 
5 pNH16a, pNHlSA, pNH46A, available from Stratagene Cloning Systems, Inc.; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNfEO, pSV2CAT, pOG44. pXTl 
and pSG available from Stratagene; and pSVKB, pBPV, pMSG and pSVL available 
from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan. 

10 Introduction of the construct into the host cell can be effected by calcium 

phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfectioh, electroporation, transduction, infection, or other methods. Such methods 
are described in many standard laboratory manuals, such as Davis et al., Basic Methods 
In Molecular Biology (1986). It is specifically contemplated that the polypeptides of the 

15 present invention may in fact be expressed by a host cell lacking a recombinant vector. 

A polypeptide of this invention can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 

20 chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC*') is employed for 
purification. 

Polypeptides of the present invention, and preferably the secreted form, can also 
be recovered from: products purified from natural sources, including bodily fluids, 

25 tissues and cells, whether directly isolated or cultured; products of chemical synthetic 
procedures; and products produced by recombinant techniques from a prokaryotic or 
eukaryotic host, including, for example, bacterial, yeast, higher plant, insect, and 
mammalian cells. Depending upon the host employed in a recombinant production 
procedure, the polypeptides of the present invention may be glycosylated or may be 

30 non-glycosylated. In addition, polypeptides of the invention may also include an initial 
modified methionine residue, in some cases as a result of host-mediated processes. 
Thus, it is well known in the art that the N-terminal methionine encoded by the 
translation initiation codon generally is removed with high efficiency from any protein 
after translation in all eukaryotic cells. While the N-terminal methionine on most 

35 proteins also is efficiently removed in most prokaryotes, for some proteins, this 

prokaryotic removal process is inefficient, depending on the nature of the amino acid to 
which the N-terminal methionine is covalently linked. 
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Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes 
5 known techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present invention 
10 can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted 
exon in the genomic DNA. These primers are then used for PCR screening of somatic 
1 5 cell hybrids containing individual human chromosomes. Only those hybrids containing 
the human gene corresponding to the SEQ ED NO:X wiU yield an amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides can 
20 be achieved with panels of specific chromosome fragments. Other gene mapping 

strategies that can be used include in situ hybridization, prescreening with labeled flow- 
sorted chromosomes, and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 
25 fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This 
technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 
2,000-4,000 bp are preferred. For a review of this technique, see Verma et al., 
"Human Chromosomes: a Manual of Basic Techniques," Perg£unon Press, New York 
(1988). 

30 For chromosome mapping, the polynucleotides can be used individually (to 

mark a single chromosome or a single site on that chromosome) or in panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved within gene families, thus increasing the chance of cross 

35 hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Linkage 
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analysis establishes coinheritance between a chromosomal location and presentation of a 
particular disease. (Disease mapping data are found, for example, in V. McKusick, 
Mendelian Inheritance in Man (available on line through Johns Hopkins University 
Welch Medical Library) .) Assuming 1 megabase mapping resolution and one gene per 
5 20 kb, a cDN A precisely localized to a chromosomal region associated with the disease 
could be one of 50-5(X) potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 

10 translocations, are examined in chromosome spreads or by PGR. If no structural 

alterations exist, the presence of point mutations are ascertained. Mutations observed in 
some or all affected individuals, but not in normal individuals, indicates that the 
mutation may cause the disease. However, complete sequencing of the polypeptide and 
the corresponding gene from several normal individuals is required to distinguish the 

15 mutation from a polymorphism. If a new polymorphism is identified, this polymorphic 
polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed using 
polynucleotides of the present invention. Any of these alterations (altered expression, 

20 chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Both methods 
rely on binding of the polynucleotide to DNA or RNA. For these techniques, preferred 

25 polynucleotides are usually 20 to 40 bases in length and complementary to either the 
region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids 
Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et al.. Science 
251:1360 (1991) ) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene Expression, CRC 

30 Press, Boca Raton, FL (1988).) Triple helix formation optimally results in a shut-off 
of RNA transcription from DNA, while antisense RNA hybridization blocks translation 
of an mRNA molecule into polypeptide. Both techniques are effective in model 
systems, and the information disclosed herein can be used to design antisense or triple 
helix polynucleotides in an effort to treat disease. 

35 Polynucleotides of the present invention are also useful in gene therapy. One 

goal of gene therapy is to insert a normal gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
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present invention offer a means of targeting such genetic defects in a highly accurate 
manner. Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 

5 biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RPLP) for identification of its personnel. In 
this technique, an individual's genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique bands for identifying 
personnel. This method does not suffer from the current limitations of "Dog Tags" 

10 which can be lost, switched, or stolen, making positive identification difficult. The 
polynucleotides of the present invention can be used as additional DNA markers for 
RFLP. 

The polynucleotides of the present invention can also be used as an alternative to 
RFLP, by determining the actual base-by-base DNA sequence of selected portions of an 

15 individual's genome. These sequences can be used to prepare PGR primers for 

amplifying and isolating such selected DNA, which can then be sequenced. Using this 
technique, individuals can be identified because each individual will have a unique set 
of DNA sequences. Once an unique ID database is established for an individual, 
positive identification of that individual, living or dead, can be made from extremely 

20 small tissue samples. 

Forensic biology also benefits from using DNA-based identification techniques 
as disclosed herein. DNA sequences taken from very small biological samples such as 
tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be 
amplified using PGR. hi one prior art technique, gene sequences amplified from 

25 polymorphic loci, such as DQa class II HLA gene, are used in forensic biology to 

identify individuals. (Erhch, H., PGR Technology, Freeman and Go. (1992).) Once 
these specific polymorphic loci are amplified, they are digested with one or more 
restriction enzymes, yielding an identifying set of bands on a Southern blot probed with 
DNA corresponding to the DQa class n HLA gene. Similarly, polynucleotides of the 

30 present invention can be used as polymorphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 
tissue. Such need arises, for example, in forensics when presented with tissue of 
unknown origin. Appropriate reagents can comprise, for example, DNA probes or 
primers specific to particular tissue prepared from the sequences of the present 

35 invention. Panels of such reagents can identify tissue by species and/or by organ type. 
In a similar fashion, these reagents can be used to screen tissue cultures for 
contamination. 



aXXJlD: <WO .9839448A?_I_> 



wo 98/39448 PCT/US98/04493 

183 

In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making oligomers 
5 for attachment to a "gene chip" or other support, to raise anti-DNA antibodies using 
DNA immunization techniques, and as an antigen to elicit an inunune response. 

Uses of the Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 
10 following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression in 
tissues can be studied with classical immunohistological methods. (Jalkanen, M., et 
al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087- 
15 3096 (1987).) Other antibody-based methods useful for detecting protein gene 

expression include immunoassays, such as the enzyme linked immunosorbent assay 
(ELISA) and the radioinmiunoassay (RIA). Suitable antibody assay labels are known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251, 1211), carbon (14C). sulfur (35S), tritium (3H), indium (1 12In), and 
20 technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
25 radiography, suitable labels include radioisotopes such as barium or cesium, which emit 
detectable radiation but are not overtly harmful to the subject. Suitable markers for 
NMR and ESR include those with a detectable characteristic spin, such as deuterium, 
which may be incorporated into the antibody by labeling of nutrients for the relevant 
hybridoma. 

30 A protein-specific antibody or antibody fragment which has been labeled with 

an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 
1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear magnetic 
resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the art that the size of the 

35 subject and the imaging system used will determine the quantity of imaging moiety 

needed to produce diagnostic images. In the case of a radioisotope moiety, for a human 
subject, the quantity of radioactivity injected will normally range from about 5 to 20 
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millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. In 
vivo tumor imaging is described in S.W. Burchiel et al., "Immunopharmacokinetics of 
Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: The 
5 Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson 
Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves 
(a) assaying the expression of a polypeptide of the present invention in cells or body 
fluid of an individual; (b) comparing the level of gene expression with a standard gene 

10 expression level, whereby an increase or decrease in the assayed polypeptide gene 
expression level compared to the standard expression level is indicative of a disorder. 

Moreover, polypeptides of the present invention can be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 

15 supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 
activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 

20 response (e.g., blood vessel growth). 

Similarly, antibodies directed to a polypeptide of the present invention can also 
be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, such 

25 as by binding to a polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention could be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods well known to those of skill in the art. Polypeptides can also 
be used to raise antibodies, which in turn are used to measure protein expression from a 

30 recombinant cell, as a way of assessing transformation of the host cell. Moreover, the 
polypeptides of tiie present invention can be used to test the following biological 
activities. 

Biological Activities 

35 The polynucleotides and polypeptides of the present invention can be used in 

assays to test for one or more biological activities. If these polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
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may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Activity 

5 A polypeptide or polynucleotide of the present invention may be useful in 

treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through a process called hematopoiesis, producing myeloid (platelets, red 
blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) cells 

10 from pluripotent stem cells. The etiology of these inunune deficiencies or disorders 

may be genetic, somatic, such as cancer or some autoimmune disorders, acquired (e.g., 
by chemotherapy or toxins), or infectious. Moreover, a polynucleotide or polypeptide 
of the present invention can be used as a marker or detector of a particular immune 
system disease or disorder. 

15 A polynucleotide or polypeptide of the present invention may be useful in 

treating or detecting deficiencies or disorders of hematopoietic cells. A polypeptide or 
polynucleotide of the present invention could be used to increase differentiation and 
proliferation of hematopoietic cells, including the pluripotent stem cells, in an effort to 
treat those disorders associated with a decrease in certain (or many) types hematopoietic 

20 cells. Examples of immunologic deficiency syndromes include, but are not limited to: 
blood protein disorders (e.g. agammaglobulinemia, dysgammaglobulinemia), ataxia 
telangiectasia, common variable inununodeficiency, Digeorge Syndrome, HTV 
infection, HTLV-BLV infection, leukocyte adhesion deficiency syndrome, 
lymphopenia, phagocyte bactericidal dysfunction, severe combined immunodeficiency 

25 (SCIDs), Wiskott-Aldrich Disorder, anemia, thrombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity (clot 
formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to Ureat blood 

30 coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 

disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that can 
decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treatment of heart attacks 

35 (infarction), strokes, or scarring. 

A polynucleotide or polypeptide of the present invention may also be useful in 
treating or detecting autoimmune disorders. Many autoimmune disorders result from 
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inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an inunune response leading to the destruction of the 
host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation, 
5 differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the present 
invendon include, but are not limited to: Addison's Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 
10 glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis, 
Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, Pemphigus, 
Polyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, Autoimmune 
Thyroiditis, Systemic Lupus Erytheniatosus, Autoimmune Pulmonary Inflaiimiation, 
Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and autoimmune 
15 inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asthma) or other respiratory problems, may also be treated by a polypeptide or 
polynucleotide of the present invention. Moreover, these molecules can be used to treat 
anaphylaxis, hypersensitivity to an antigenic molecule, or blood group incompatibility. 
20 A polynucleotide or polypeptide of the present invention may also be used to 

treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through an 
inunune response. Similarly, an immune response is also involved in GVHD, but, in 
this case, the foreign transplanted inunune cells destroy the host tissues. The 
25 administration of a polypeptide or polynucleotide of the present invention that inhibits 
an immune response, particularly the proliferation, differentiation, or chemotaxis of T- 
cells, may be an effective therapy in preventing organ rejection or GVHD. 

Similarly, a polypeptide or polynucleotide of the present invention may also be 
used to modulate inflammation. For example, the polypeptide or polynucleotide may 
30 inhibit the proliferation and differentiation of cells involved in an inflammatory 

response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflanunation associated with infection (e.g., septic 
shock, sepsis, or systemic inflanunatory response syndrome (SIRS)), ischemia- 
reperfusion injury, endotoxin lethality, arthritis, complement-mediated hjrperacute 
35 rejection, nephritis, cytokine or chemokine induced lung injury, inflammatory bowel 

disease, Crohn*s disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) 
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Hvperproliferative Disorders 

A polypeptide or polynucleotide can be used to treat or detect hyperproliferative 
disorders, including neoplasms. A polypeptide or polynucleotide of the present 
5 invention may inhibit the proliferation of the disorder through direct or indirect 

interactions. Alternatively, a polypeptide or polynucleotide of the present invention 
may proliferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an inmiune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, differentiating, 
10 or mobilizing T-cells, hyperproliferative disorders can be treated. This inmiune 

response may be increased by either enhancing an existing immune response, or by 
initiating a new immune response. Alternatively, decreasing an immune response may 
also be a method of treating hyperproliferative disorders, such as a chemotherapeutic 
agent. 

15 Examples of hyperproliferative disorders that can be treated or detected by a 

polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 

20 pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by a 
polynucleotide or polypeptide of the present invention. Examples of such 
hyperproliferative disorders include, but are not limited to: hyperganmiaglobulinemia, 
lymphoproliferative disorders, paraproteinemias, purpura, sarcoidosis, Sezary 

25 Syndrome, Waldenstron's Macroglobulinemia, Gaucher' s Disease, histiocytosis, and 
any other hyperproliferative disease, besides neoplasia, located in an organ system 
listed above. 

Infectious Disease 

30 A polypeptide or polynucleotide of the present invention can be used to treat or 

detect infectious agents. For example, by increasing the inunune response, particularly 
increasing the proliferation and differentiation of B and/or T cells, infectious diseases 
may be treated. The immune response may be increased by either enhancing an existing 
inmiune response, or by initiating a new inunune response. Alternatively, the 

35 polypeptide or polynucleotide of the present invention may also directly inhibit the 
infectious agent, without necessarily eliciting an immune response. 
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Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
5 Bimaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Flaviviridae, 
Hepadnaviridae (Hepatitis), Herpesviridae (such as. Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g.. Influenza), Papovaviridae, Parvoviridae, 
Picomaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 

10 Rotavirus), Retroviridae (HTLV-I, HTLV-Il, Lentivirus), and Togaviridae (e.g., 
Rubivims). Viruses falling within these families can cause a variety of diseases or 
symptoms, including, but not limited to: arthritis, bronchioUitis, encephalitis, eye 
infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 

15 pneumonia, Burkitt's Lymphoma, chickenpox , hemorrhagic fever. Measles, Mumps, 
Parainfluenza, Rabies, the conmion cold. Polio, leukemia. Rubella, sexually 
transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

20 Similarly, bacterial or fungal agents that can cause disease or symptoms and that 

can be treated or detected by a polynucleotide or polypeptide of the present invention 
include, but not limited to, the following Gram-Negative and Gram-positive bacterial 
families and fiingi: Actinomycetales (e.g., Corynebacterium, Mycobacterium, 
Norcardia), Aspergillosis, Bacillaceae (e.g.. Anthrax, Clostridium), Bacteroidaceae, 

25 Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, Campylobacter, 

Coccidioidomycosis, Cryptococcosis, Dermatocycoses, Enterobacteriaceae (Klebsiella, 
Salmonella, Serratia, Yersinia), Erysipelothrix, Helicobacter, Legionellosis, 
Leptospirosis, Listeria, Mycoplasmatales, Neissei^iaceae (e.g., Acinetobacter, 
Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., Actinobacillus, 

30 Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, Chlamydiaceae, Syphilis, 

and Staphylococcal. These bacterial or fungal families can cause the following diseases 
or symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS 
related infections), paronychia, prosthesis-related infections, Reiter's Disease, 

35 respiratory tract infections, such as Whooping Cough or Empyema, sepsis, Lyme 
Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food poisoning, 
Typhoid, pneumonia. Gonorrhea, meningitis. Chlamydia, Syphilis, Diphtheria, 
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Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, 
impetigo. Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin diseases 
(e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound infections. 
A polypeptide or polynucleotide of the present invention can be used to treat or detect 
5 any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, Helminthiasis, 

10 Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and Trichomonas. 

These parasites can cause a variety of diseases or symptoms, including, but not limited 
to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., dysentery, 
giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS related). 
Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or polynucleotide 

15 of the present invention can be used to treat or detect any of these symptoms or 
diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the cells with a polynucleotide 
20 of the present invention, and returning the engineered cells to the patient (ex vivo 

therapy). Moreover, the polypeptide or polynucleotide of the present invention can be 
used as an antigen in a vaccine to raise an irrunune response against infectious disease. 

Regeneration 

25 A polynucleotide or polypeptide of the present invention can be used to 

differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See, 
Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, trauma (wounds, bums, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 

30 disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, reperfusion 
injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vascular (including vascular endothelium), nervous, hematopoietic, and 

35 skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs 
without or decreased scarring. Regeneration also may include angiogenesis. 
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Moreover, a polynucleotide or polypeptide of the present invention may increase 
regeneration of tissues difficult to heal. For example, increased tendon/ligament 
regeneration would quicken recovery time after damage. A polynucleotide or 
polypeptide of the present invention could also be used prophylactically in an effort to 
5 avoid damage. Specific diseases that could be treated include of tendinitis, carpal tunnel 
syndrome, and other tendon or ligament defects. A further example of tissue 
regeneration of non-healing wounds includes pressure ulcers, ulcers associated with 
vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a 

10 polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases tiiat could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 

15 neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be treated using the polynucleotide or polypeptide of the 
present invention. 

20 

Chemotaxis 

A polynucleotide or polypeptide of the present invention may have chemotaxis 
activity. A chemotaxic molecule attracts or mobilizes cells (e.g., monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 

25 cells) to a particular site in the body, such as inflanmiation, infection, or site of 

hyperproliferation. The mobilized cells can then fight off and/or heal the particular 
trauma or abnormzility. 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity of psuticular cells. These chemotactic molecules can then be used to 

30 treat inflanmiation, infection, hyperproliferative disorders, or any inmiune system 

disorder by increasing the number of cells targeted to a particular location in the body. 
For example, chemotaxic molecules can be used to treat wounds and other trauma to 
tissues by attracting immune cells to the injured location. Chemotactic molecules of the 
present invention can also attract fibroblasts, which can be used to treat wounds. 

35 It is also contemplated that a polynucleotide or polypeptide of the present 

invention may inhibit chemotactic activity. These molecules could also be used to treat 
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disorders. Thus, a polynucleotide or polypeptide of the present invention could be used 
as an inhibitor of chemotaxis. 

Binding Activity 

5 A polypeptide of the present invention may be used to screen for molecules that 

bind to the polypeptide or for molecules to which the polypeptide binds. The binding 
of the polypeptide and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or 

10 small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 
or functional mimetic. (See, Coligan et al.. Current Protocols in Immunology 
l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 

15 receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate 
cells which express the polypeptide, either as a secreted protein or on the cell 

20 membrane. Preferred cells include cells from mammals, yeast, Drosophila, or £. colL 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially containing 
the molecule to observe binding, stimulation, or inhibition of activity of either the 
polypeptide or the molecule. 

25 The assay may simply test binding of a candidate compound to the polypeptide, 

wherein binding is detected by a label, or in an assay involving competition with a 
labeled competitor. Further, the assay may test whether the candidate compound results 
in a signal generated by binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 

30 polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
activity or binding, and comparing the polypeptide/molecule activity or binding to a 
standard. 

35 Preferably, an ELIS A assay can measure polypeptide level or activity in a 

sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
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antibody can measure polypeptide level or activity by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 

All of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
5 particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or 
enhance the production of the polypeptide from suitably manipulated cells or tissues. 

Therefore, the invention includes a method of identifying compounds which 
bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
10 candidate binding compound with a polypeptide of the invention; and (b) determining if 
binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound with 
a polypeptide of the invention, (b) assaying a biological activity , and (b) determining if 
a biological activity of the polypeptide has been altered. 

15 

Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 
discussed above, hematopoietic lineage. 

20 A polypeptide or polynucleotide of the present invention may also be used to 

modulate mammalian characteristics, such as body height, weight, hair color, eye color, 
skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 
used to modulate mammalian metabolism affecting catabolism, anabolism, processing, 

25 utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to change 
a mammal's mental state or physical state by influencing biorhythms, caricadic 
rhythms, depression (including depressive disorders), tendency for violence, tolerance 
for pain, reproductive capabilities (preferably by Activin or Inhibin-like activity), 

30 hormonal or endocrine levels, appetite, libido, memory, stress, or other cognitive 
qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
content, lipid, protein, carbohydrate, vitamins, minerals, cofactprs or other nutritional 
35 components. 
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Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
to a sequence of at least about 50 contiguous nucleotides in the nucleotide sequence of 
5 SEQ ID NO;X wherein X is any integer as defined in Table 1 . 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
Clone Sequence and ending with the nucleotide at about the position of the 3' 
10 Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
Start Codon and ending with the nucleotide at about the position of the 3' Nucleotide of 
1 5 the Clone Sequence as defined for SEQ ID NO:X in Table 1 . 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
range of positions beginning with the nucleotide at about the position of the 5' 
Nucleotide of the First Amino Acid of the Signal Peptide and ending with the nucleotide 
20 at about the position of the 3* Nucleotide of the Clone Sequence as defined for SEQ ID 
NO:X in Table 1. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 

sequence which is at least 95% identical to a sequence of at least about 150 contiguous 

nucleotides in the nucleotide sequence of SEQ ID NO:X. 
25 Further preferred is an isolated nucleic acid molecule comprising a nucleotide 

sequence which is at least 95% identical to a sequence of at least about 500 contiguous 

nucleotides in the nucleotide sequence of SEQ ID NO:X. 

A further preferred embodiment is a nucleic acid molecule comprising a 

nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
30 ED NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of the 

First Amino Acid of the Signal Peptide and ending with the nucleotide at about the 

position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in 

Table 1. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
35 a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence of SEQ ID NO:X. 
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Also preferred is an isolated nucleic acid molecule which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic acid 
molecule which hybridizes does not hybridize under stringent hybridization conditions 
to a nucleic acid molecule having a nucleotide sequence consisting of only A residues or 
5 of only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1. for said 
1 0 cDN A Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 , which DNA molecule is contained in the deposit given the 
15 ATCC Deposit Number shown in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at 
least 50 contiguous nucleotides is included in the nucleotide sequence of the complete 
open reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
20 sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDN A clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to sequence of at least 500 
contiguous nucleotides in the nucleotide sequence encoded by said human cDNA clone. 
25 A further preferred embodiment is an isolated nucleic acid molecule comprising 

a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence encoded by said human cDNA clone. 

A further preferred embodiment is a method for detecting in a biological sample 
a nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
30 to a sequence of at least 50 contiguous nucleotides in a sequence selected from the 

group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer 
as defined in Table 1 ; and a nucleotide sequence encoded by a human cDN A clone 
identified by a cDN A Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1; which method 
35 comprises a step of comparing a nucleotide sequence of at least one nucleic acid 

molecule in said sample with a sequence selected from said group and determining 
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whether the sequence of said nucleic acid molecule in said sample is at least 95% 
identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 

5 molecules in said sample and a nucleic acid molecule comprising said sequence selected 
from said group. Similarly, also preferred is the above method wherein said step of 
comparing sequences is performed by comparing the nucleotide sequence determined 
from a nucleic acid molecule in said sample with said sequence selected from said 
group. The nucleic acid molecules can comprise DNA molecules or RNA molecules. 

10 A further preferred embodiment is a method for identifying the Sf)ecies, tissue or 

cell type of a biological sample which method comprises a step of detecting nucleic acid 
molecules in said sample, if any, comprising a nucleotide sequence that is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any 

15 integer as defined in Table 1; and a nucleotide sequence encoded by a human cDNA 

clone identified by a cDN A Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1, 

The method for identifying the species, tissue or cell type of a biological sample 
can comprise a step of detecting nucleic acid molecules comprising a nucleotide 

20 sequence in a panel of at least two nucleotide sequences, wherein at least one sequence 
in said panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 

25 identified in Table 1 , which method comprises a step of detecting in a biological sample 
obtained from said subject nucleic acid molecules, if any, comprising a nucleotide 
sequence that is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in a sequence selected from the group consisting of: a nucleotide sequence 
of SEQ ID NO:X wherein X is any integer as defined in Table 1; and a nucleotide 

30 sequence encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table 1. 

The method for diagnosing a pathological condition can comprise a step of 
detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at least 

35 two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
said group. 



SOOOD: <WO_9839448A^I_> 



wo 98/39448 



196 



PCT/US98/04493 



Also preferred is a comp)Osition of matter comprising isolated nucleic acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules comprise a 
panel of at least two nucleotide sequences, wherein at least one sequence in said panel is 
at least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 

5 selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein 
X is any integer as defined in Table 1; and a nucleotide sequence encoded by a human 
cDN A clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. . 

10 Also preferred is an isolated polypeptide comprising an amino acid sequence at 

least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1. 

Also preferred is a polypeptide, wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of SEQ ID NO:Y in the range of positions 

15 beginning with the residue at about the position of the First Amino Acid of the Secreted 
Portion and ending with the residue at about the Last Amino Acid of the Open Reading 
Frame as set forth for SEQ ID NO: Y in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the 

20 amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 100 contiguous an^no acids in the 
amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 

25 at least 95% identical to the complete amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
complete amino acid sequence of a secreted protein encoded by a human cDN A clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 

30 ATCC Deposit Number shown for said cDN A clone in Table 1 . 

Also preferred is a polypeptide wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of a secreted portion of the secreted protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and 
contained in the deposit with the ATCC Deposit Number shown for said cDN A clone in 

35 Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
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amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone m Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
5 least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
10 least 95% identical to the amino acid sequence of the secreted portion of the protein 

encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table 1. 

Further preferred is an isolated antibody which binds specifically to a 

15 polypeptide comprising an amino acid sequence that is at least 90% identical to a 

sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO; Y wherein Y is any integer as 
defined in Table 1; and a complete amino acid sequence of a protein encoded by a 
human cDN A clone identified by a cDNA Clone Identifier in Table 1 and contained in 

20 the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of: an 
amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 ; 

25 and a complete amino acid sequence of a protein encoded by a human cDNA clone 

identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDN A clone in Table 1 ; which method 
comprises a step of comparing an amino acid sequence of at least one polypeptide 
molecule in said sample with a sequence selected from said group and determining 

30 whether the sequence of said polypeptide molecule in said sample is at least 90% 
identical to said sequence of at least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino 
acid sequence of at least one polypeptide molecule in said sample with a sequence 
selected from said group comprises determining the extent of specific binding of 

35 polypeptides in said sample to an antibody which binds specifically to a polypeptide 

comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an 
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amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 
5 Also preferred is the above method wherein said step of comparing sequences is 

performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 

Also preferred is a method for identifying the species, tissue or cell type of a 
biological sample which method comprises a step of detecting p>olypeptide molecules in 

10 said sample, if any, comprising an amino acid sequence that is at least 90% identical to 
a sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1; and a complete amino acid sequence of a secreted protein encoded 
by a human cDN A clone identified by a cDNA Clone Identifier in Table 1 and contained 

1 5 in the deposit with the ATCC Deposit Number shown for said cDN A clone in Table 1 . 

Also preferred is the above method for identifying the species, tissue or cell type 
of a biological sample, which method comprises a step of detecting polypeptide 
molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 

20 sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table 1, which method comprises a step of detecting in a biological sample 

25 obtained from said subject polypeptide molecules comprising an amino acid sequence in 
a panel of at least two amino acid sequences, wherein at least one sequence in said panel 
is at least 90% identical to a sequence of at least 10 contiguous amino acids in a 
sequence selected from the group consisting of: an amino acid sequence of SEQ ID 
NO:Y wherein Y is any integer as defined in Table 1; and a complete amino acid 

30 sequence of a secreted protein encoded by a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 
shown for said cDNA clone in Table 1. 

In any of these methods, the step of detecting said polypeptide molecules 
includes using an antibody. 

35 Also preferred is an isolated nucleic acid molecule comprising a nucleotide 

sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at least 
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90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of; an amino acid sequence of SEQ ID NO:Y wherein Y is 
any integer as defined in Table 1 ; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 
1 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said polypeptide 
in a prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 
comprises an amino acid sequence selected from the group consisting of: an amino acid 
sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1. 

Further preferred is a method of making a recombinant vector comprising 
inserting any of the above isolated nucleic acid molecule into a vector. Also preferred is 
the recombinant vector produced by this method. Also preferred is a method of making 
a recombinant host cell comprising introducing the vector into a host cell, as well as the 
recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising 
culturing this recombinant host cell under conditions such that said polypeptide is 
expressed and recovering said polypeptide. Also preferred is this method of making an 
isolated polypeptide, wherein said recombinant host cell is a eukaryotic cell and said 
polypeptide is a secreted portion of a human secreted protein comprising an amino acid 
sequence selected from the group consisting of: an amino acid sequence of SEQ ID 
NO:Y beginning with the residue at the position of the First Amino Acid of the Secreted 
Portion of SEQ ID NO: Y wherein Y is an integer set forth in Table 1 and said position 
of the First Amino Acid of the Secreted Portion of SEQ ID NO: Y is defined in Table 1 ; 
and an amino acid sequence of a secreted portion of a protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDN A clone in Table 1 . The 
isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an increased 
level of a secreted protein activity, which method comprises administering to such an 
individual a pharmaceutical composition comprising an amount of an isolated 
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polypeptide, polynucleotide, or antibody of the claimed invention effective to increase 
the level of said protein activity in said individual. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
5 illustration and are not intended as limiting. 



Examples 

Example 1; Isolation of a Selected cDNA Clone From the Deposited 
Sample 

10 Each cDNA clone in a cited ATCC deposit is contained in a plasmid vector. 

Table 1 identifies the vectors used to construct the cDN A library from which each clone 
was isolated. In many cases, the vector used to construct the library is a phage vector 
from which a plasmid has been excised. The table immediately below correlates the 
related plasmid for each phage vector used in constructing the cDNA library. For 
1 5 example, where a particular clone is identified in Table 1 as being isolated in the vector 
"Lambda Zap," the corresponding deposited clone is in "pBluescript." 

Vector Used to Construct Library Corresponding Deposited Plasmid 

Lambda Zap pBluescript (pBS) 

Uni-Zap XR pBluescript (pBS) 

20 Zap Express pBK 

lafinid BA plafmid B A 

pSportl pSportl 
pCMVSport 2.0 pCMVSport 2.0 

pCMVSport 3.0 pCMVSport 3.0 

25 pCR®2.1 pCR®2.1 

Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 
XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al.. Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
30 17:9494 (1989)) and pBK (Alting-Mees, M. A. et al.. Strategies 5:58-61 (1992)) are 
conmiercially available from Stratagene Cloning Systems, Inc., 1 101 1 N. Torrey Pines 
Road, La JoUa, CA, 92037. pBS contains an ampicillin resistance gene and pBK 
contains a neomycin resistance gene. Both can be transformed into E. coli strain XL-1 
Blue, also available from Stratagene. pBS comes in 4 forms SK-h, SK-, KS+ and KS. 
35 The S and K refers to the orientation of the polylinker to the T7 and T3 primer 

sequences which flank the polylinker region ("S" is for Sad and "K" is for Kpnl which 
are the first sites on each respective end of the linker). or refer to the orientation 
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of the fl origin of replication (*'ori"), such that in one orientation, single stranded rescue 
initiated from the f 1 ori generates sense strand DNA and in the other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. AU Sport vectors 
5 contain an ampicillin resistance gene and may be transformed into E. coli strain 

DHIOB, also available from Life Technologies. (See, for instance, Gruber, C. E„ et 
al.. Focus 15:59 (1993).) Vector lafmid BA (Bento Soares, Columbia University, NY) 
contains an ampicillin resistance gene and can be U-ansformed into E. coli strain XL-1 
Blue. Vector pCR®2.1, which is available from Invitrogen, 1600 Faraday Avenue, 

10 Carlsbad, CA 92008, contains an ampicillin resistance gene and may be transformed 
into E. coli su-ain DHIOB, available from Life Technologies. (See, for instance, Clark, 
J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead. D. et al., Bio/Technology 9: 
(1991).) Preferably, a polynucleotide of the present invention does not comprise the 
phage vector sequences identified for the particular clone in Table 1, as well as the 

15 corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number cited 
in Table 1 for any given cDNA clone also may contain one or more additional plasmids, 
each comprising a cDNA clone different from that given clone. Thus, deposits sharing 
the same ATCC Deposit Number contain at least a plasmid for each cDNA clone 

20 identified in Table 1. Typically, each ATCC deposit sample cited in Table 1 comprises 
a mixture of approximately equal amounts (by weight) of about 50 plasmid DNAs, each 
containing a different cDNA clone; but such a deposit sample may include plasmids for 
more or less than 50 cDNA clones, up to about 500 cDNA clones. 

Two approaches can be used to isolate a particular clone from the deposited 

25 sample of plasmid DNAs cited for that clone in Table 1. First, a plasmid is directly 

isolated by screening the clones using a polynucleotide probe corresponding to SEQ ID 
NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 

30 The oligonucleotide is labeled, for instance, with ^^P-y-ATP using T4 polynucleotide 
kinase and purified according to routine methods. (E.g., Maniatis et al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring. NY (1982).) 
The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL-1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 

35 those provided by the vector supplier or in related publications or patents cited above. 
The transformants are plated on 1.5% agar plates (containing the appropriate selection 
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agent, e.g., ampicillin) to a density of about 150 transfomiants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
bacterial colony screening (e.g., Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1.93 to 
5 1.104), or other techniques known to those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 
SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5* NT and the 
3* NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain reaction 
10 is carried out under routine conditions, for instance, in 25 \il of reaction mixture with 
0.5 ug of the above cDN A template. A convenient reaction mixture is 1 .5-5 mM 
MgClj, 0.01% (w/v) gelatin, 20 ^M each of dATP. dCTP, dGTP, dTTP, 25 pmol of 
each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation 

at 94'*C for 1 min; annealing at 55''C for 1 min; elongation at 72°C for 1 min) are 

15 performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified product 
is analyzed by agarose gel electrophoresis and the DNA band with expected molecular 
weight is excised and purified. The PCR product is verified to be the selected sequence 
by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5* or 3* non-coding 

20 portions of a gene which may not be present in the deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 
and protocols similar or identical to 5' and 3' "RACE" protocols which are well known 
in the art. For instance, a method similar to 5* RACE is available for generating the 
missing 5* end of a desired full-length transcript. (Fromont-Racine et al.. Nucleic Acids 

25 Res. 2 1(7): 1683" 1684 (1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population 
of RNA presumably containing full-length gene RNA transcripts. A primer set 
containing a primer specific to the ligated RNA oligonucleotide and a primer specific to 
a known sequence of the gene of interest is used to PCR amplify the 5' portion of the 

30 desired full-length gene. This amplified product may then be sequenced and used to 
generate the full length gene. 

This above method starts with total RNA isolated from the desired source, 
although poly-A+ RNA can be used. The RNA preparation can then be treated with 
phosphatase if necessary to eliminate 5' phosphate groups on degraded or damaged 

35 RNA which may interfere with the later RNA ligase step. The phosphatase should then 
be inactivated and the RNA treated with tobacco acid pyrophosphatase in order to 
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remove the cap structure present at the 5* ends of messenger RNAs. This reaction 
leaves a 5* phosphate group at the 5' end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
5 synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end iising a primer specific to 
the ligated RNA oligonucleotide and a primer specific to the known sequence of the 
gene of interest. The resultant product is then sequenced and analyzed to confirm that 
the 5* end sequence belongs to the desired gene. 

10 

Example 2; Isolation of Genomic Clones Corresponding to a 
Polynucleotide 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:X., 
15 according to the method described in Example 1. (See also, Sambrook.) 

Example 3: Tissue Distribution of Polypeptide 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by, 

20 among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example 1 is labeled with P^^ using the rediprime™ DNA labeling system 
(Amersham Life Science), according to manufacturer's instmctions. After labeling, the 
probe is purified using CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), 
according to manufacturer's protocol number PT 1200-1. The purified labeled probe is 

25 then used to examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) or 
human immune system tissues (IM) (Clontech) are examined with the labeled probe 
using ExpressHyb™ hybridization solution (Clontech) according to manufacturer's 
protocol number PTl 190-1 . Following hybridization and washing, the blots are 

30 mounted and exposed to film at -TC'C overnight, and the films developed according to 

standard procedures. 



Example 4: Chromosomal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5' 
35 end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
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conditions : 30 seconds, 95^C\ 1 minute, 56°C; 1 minute, 70**C. This cycle is repeated 

32 times followed by one 5 minute cycle at lO'^C, Human, mouse, and hamster DNA 

is used as template in addition to a somatic cell hybrid panel containing individual 
chromosomes or chromosome fragments (Bios, Inc). The reactions is analyzed on 
5 either 8% poly aery lamide gels or 3.5 % agarose gels. Chromosome mapping is 

determined by the presence of an approximately 100 bp PCR fragment in the particular 
somatic cell hybrid. 

Example S; Bacterial Expression of a Polypeptide 

10 A polynucleotide encoding a polypeptide of the present invention is amplified 

using PCR oligonucleotide primers corresponding to the 5* and 3* ends of the DNA 
sequence, as outlined in Example 1, to synthesize insertion fragments. The primers 
used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5* end of the primers in order to clone the amplified product 

15 into the expression vector. For example, BamHI and Xbal correspond to the restriction 
enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., Chatsworth, 

CA). This plasmid vector encodes antibiotic resistance (AmpO» a bacterial origin of 
replication (on), an IPTG-regulatable promoter/operator (P/O), a ribosome binding site 
(RBS), a 6-histidine tag (6-His), and restriction enzyme cloning sites. 
20 The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment 

is ligated into the pQE-9 vector maintaining the reading frame initiated at the bacterial 
RBS. The ligation mixture is then used to transform the E. coli strain M15/rep4 
(Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, which expresses 

the lad repressor and also confers kanamycin resistance (KanO. Transformants are 
25 identified by their ability to grow on LB plates and ampicillin/kanamycin resistant 

colonies are selected. Plasmid DNA is isolated and confirmed by restriction analysis. 
Clones containing the desired constructs are grown overnight (O/N) in liquid 

culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 

The O/N culture is used to inoculate a large culture at a ratio of 1:100 to 1:250. The 
30 cells are grown to an optical density 600 (O.D.^ of between 0.4 and 0.6. IPTG 

(Isopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 mM. 

IPTG induces by inactivating the lad repressor, clearing the P/O leading to increased 

gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
35 centrifiigation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 
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agent 6 Molar Guanidine HCl by stirring for 3-4 hours at 4**C. The cell debris is 
removed by centrifugation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from 
QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high 
5 affinity and can be purified in a simple one-step procedure (for details see: The 
QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 8, 
the column is first washed with 10 volumes of 6 M guanidine-HCl. pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is eluted with 

10 6 M guanidine-HCl. pH 5. 

The purified protein is then renatured by dialyzing it against phosphate-buffered 
saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCL Alternatively, the 
protein can be successfully refolded while immobilized on the Ni-NTA column. The 
recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 

15 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, containing protease inhibitors. 
The renaturation should be performed over a period of 1 .5 hours or more. After 
renaturation the proteins are eluted by the addition of 250 mM immidazole. Immidazole 
is removed by a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer 
plus 200 mM NaCl. The purified protein is stored at 4°C or frozen at -80° C. 

20 In addition to the above expression vector, the present invention further includes 

an expression vector comprising phage operator and promoter elements operatively 
linked to a polynucleotide of the present invention, called pHE4a. (ATCC Accession 
Number XXXXXX.) This vector contains: 1) a neomycinphosphotransferase gene as 
a selection marker, 2) an E. coli origin of replication, 3) a T5 phage promoter sequence, 

25 4) two lac operator sequences, 5) a Shine-Delgamo sequence, and 6) the lactose operon 
repressor gene (laclq). The origin of replication (oriC) is derived from pUC19 (LTI, 
Gaithersburg, MD). The promoter sequence and operator sequences are made 
synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 
30 Xbal, BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating 
the larger fragment (the stuffer fragment should be about 310 base pairs). The DNA 
insert is generated according to the PGR protocol described in Example 1, using PGR 
primers having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or 
Asp718 (3* primer). The PGR insert is gel purified and restricted with compatible 
35 enzymes. The insert and vector are ligated according to standard protocols. 



ISOOaO: <WO_983944aA?_l_> 



wo 98/39448 



PCTAJS98/04493 



206 

The engineered vector could easily be substituted in the above protocol to 
express protein in a bacterial system. 

Example 6: Purification of a Polypeptide from an Inclusion Body 

5 The following alternative method can be used to purify a polypeptide expressed 

in E coli when it is present in the form of inclusion bodies. Unless otherwise specified, 

all of the following steps are conducted at 4-10'*C. 

Upon completion of the production phase of the E. coli fermentation, the cell 

culture is cooled to 4-10''C and the cells harvested by continuous centrifugation at 

10 15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 
mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a 
high shear mixer. 

15 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 nun. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

20 The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 

pellet is discarded and the polypeptide containing supernatant is incubated at 4^C 

overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, 
25 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 

vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing 

for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 

30 filtration unit equipped with 0. 16 jim membrane filter with appropriate surface area 

(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
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Stepwise manner. The absorbance at 280 nm of the effluent is continuously monitored. 
Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 volumes 
of water. The diluted sample is then loaded onto a previously prepared set of tandem 
5 columns of strong anion (Poros HQ-50i Perseptive Biosystems) and weak anion 

(Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 
10 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under constant 
monitoring of the effluent. Fractions containing the polypeptide (determined, for 
instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands should be observed from 

15 Commassie blue stained 16% SDS-PAGE gel when 5 ^ig of purified protein is loaded. 

The purified protein can also be tested for endotoxin/LPS contamination, and typically 
the LPS content is less than 0.1 ng/ml according to LAL assays. 

Example 7; Cloning and Expression of a Polypeptide in a Baculovirus 
20 Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert a polynucleotide 
into a baculovirus to express a polypeptide. This expression vector contains the strong 
polyhedrin promoter of the Autographa califomica nuclear polyhedrosis virus 
(AcMNPV) followed by convenient restriction sites such as BamHI, Xba I and 

25 Asp718. The polyadenylation site of the simian virus 40 ("SV40") is used for efficient 
polyadenylation. For easy selection of recombinant virus, the plasmid contains the 
beta-galactosidase gene from E. coli under control of a weak Drosophila promoter in the 
same orientation, followed by the polyadenylation signal of the polyhedrin gene. The 
inserted genes are flanked on both sides by viral sequences for cell-mediated 

30 homologous recombination with wild-type viral DNA to generate a viable virus that 
express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, as 
long as the construct provides appropriately located signals for transcription, 

35 translation, secretion and the like, including a signal peptide and an in-frame AUG as 
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required. Such vectors are described, for instance, in Luckow et al.. Virology 170:31- 
39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including the 
AUG initiation codon and the naturally associated leader sequence identified in Table 1, 
5 is amplified using the PGR protocol described in Example 1. If the naturally occurring 
signal sequence is used to produce the secreted protein, the pA2 vector does not need a 
second signal peptide. Alternatively, the vector can be modified (pA2 GP) to include a 
baculovirus leader sequence, using the standard methods described in Sunmiers et al., 
"A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures," 
10 Texas Agricultural Experimental Station Bulletin No. 1555 (1987). 

The amplified fragment is isolated from a 1% agarose gel using a conmiercially 
available kit ("Geneclean," BIO 101 Inc., La JoUa, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
1 5 optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 
procedures known in the art. The DNA is then isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean" BIO 101 Inc., La JoUa, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E. coli HB 101 or other suitable E, coli hosts such as XL-1 Blue 
20 (Stratagene Cloning Systems, La JoUa, CA) cells are transformed with the ligation 

mixture and spread on culture plates. Bacteria containing the plasmid are identified by 
digesting DNA from individual colonies and analyzing the digestion product by gel 
electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

25 Five \i% of a plasmid containing the polynucleotide is co-transfected with 1 .0 ^ig 

of a commercially available linearized baculoviras DNA ("BacuIoGold™ baculovirus 
DNA", Pharmingen, San Diego, CA), using the lipofection method described by 
Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One ^ig of 
BaculoGold™ virus DNA and 5 |Xg of the plasmid are mixed in a sterile well of a 

30 microtiter plate containing 50 |il of serum-free Grace's medium (Life Technologies 

Inc., Gaithersburg, MD). Afterwards, 10 |J.l Lipofectin plus 90 |il Grace^s medium are 
added, mixed and incubated for 15 minutes at room temperature. Then the transfection 
mixture is added drop-wise to Sf9 insect cells (ATCC CRL 171 1) seeded in a 35 mm 
tissue culture plate with 1 ml Grace's medium without serum. The plate is then 

35 incubated for 5 hours at 27° C. The transfection solution is then removed from the plate 
and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum is added. 
Cultivation is then continued at 27° C for four days. 
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After four days the supernatant is collected and a plaque assay is performed, as 
described by Sununers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 
gaJ-expressing clones, which produce blue-stained plaques. (A detailed description of a 
5 "plaque assay" of this type can also be found in the user's guide for insect cell culture 
and baculovirology disu^ibuted by Life Technologies Inc., Gaithersburg, page 9-10.) 
After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 2(X) |il of Grace's medium and the 

10 suspension containing the recombinant baculovinis is used to infect Sf9 cells seeded in 
35 mm dishes. Four days later the supematants of these culture dishes are harvested 
and then they are stored at 4° C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 
medium supplemented with 10% heat-inactivated FBS. The cells are infected with the 

15 recombinant baculovirus containing the polynucleotide at a multiplicity of infection 
("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the medium is 
removed and is replaced with SF9(X) II medium minus methionine and cysteine 
(available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 ^iCi of ^^S- 
methionine and 5 p,Ci ^^S-cysteine (available from Amersham) are added. The cells are 

20 further incubated for 16 hours and then are harvested by centrifiigation. The proteins in 
the supernatant as well as the intracellular proteins are analyzed by SDS-PAGE 
followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of purified 
protein may be used to determine the amino terminal sequence of the produced protein. 

25 Example 8; Expression of a Polypeptide in Mammalian Cells 

The polypeptide of the present invention can be expressed in a mammalian cell. 
A typical mammalian expression vector contains a promoter element, which mediates 
the initiation of transcription of mRNA, a protein coding sequence, and signals required 
for the termination of transcription and polyadenylation of the transcript. Additional 

30 elements include enhancers, Kozak sequences and intervening sequences flanked by 
donor and acceptor sites for RNA splicing. Highly efficient transcription is achieved 
with the early and late promoters from SV40, the long terminal repeats (LTRs) from 
Retroviruses, e.g., RSV, HTLVI, HIVI and the early promoter of the cytomegalovims 
(CMV). However, cellular elements can also be used (e.g., the human actin promoter). 

35 Suitable expression vectors for use in practicing the present invention include, 

for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden). 
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pRSVcat (ATCC 37152). pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), 
pCMVSport 2.0. and pCM VSport 3.0. Mammalian host cells that could be used 
include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, 
Cos 7 and CVl, quail QCl-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
5 cells. 

Altematively, the polypeptide can be expressed in stable cell lines containing the 
polynucleotide integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation 
of the transfected cells. 

1 0 The transfected gene can also be amplified to express large amounts of the 

encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
cell lines that carry several hundred or even several thousand copies of the gene of 
interest. (See, e.g., Alt. F, W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, 
J. L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 

15 Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker is 
the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 (1991); 
Bebbington et al., Bio/Technology 10:169-175 (1992). Using these markers, the 
mammalian cells are grown in selective medium and the cells with the highest resistance 
are selected. These cell lines contain the amplified gene(s) integrated into a 

20 chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No. 209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (CuUen et 

25 al., Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 
CMV-enhancer (Boshart et al.. Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the resuiction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors also contain the 3' intron, the 
polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 

30 DHFR gene under control of the S V40 early promoter 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is amplified according to the protocol 

35 outlined in Example 1, If the naturally occurring signal sequence is used to produce the 
secreted protein, the vector does not need a second signal peptide. Altematively, if the 
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naturally occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

The amplified fragment is isolated from a 1 % agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca,). The fragment then is digested 
5 with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 
purified on a 1 % agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated with T4 DNA ligase. £. coli HBlOl or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
10 pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DUFR gene is used for 
transfection. Five ^ig of the expression plasmid pC6 is cotransfected with 0.5 jig of the 
plasmid pS Vneo using lipofectin (Feigner et al., supra). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that 
15 confers resistance to a group of antibiotics including G418. The cells are seeded in 
alpha minus MEM supplemented with 1 mg/ml G41 8. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
MEM supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml G418. 
After about 10-14 days single clones are trypsinized and then seeded in 6- well petri 
20 dishes or 10 mJ flasks using different concentrations of methotrexate (50 nM, 100 nM, 
200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
concentrations of methotrexate (1 |liM, 2 ^iM, 5 \iM, 10 mM, 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 100 - 
25 200 ^M. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Westem blot or by reversed phase HPLC analysis. 

Example 9: Protein Fusions 

The polypeptides of the present invention are preferably fused to other proteins. 

30 These fusion proteins can be used for a variety of applications. For example, fusion of 
the present polypeptides to His-tag, HA-tag, protein A, IgG domains, and maltose 
binding protein facilitates purification. (See Example 5; see also EP A 394,827; 
Traunecker, et al.. Nature 331:84-86 (1988).) Similarly, fusion to IgG-1, IgG-3, and 
albumin increases the halflife time in vivo. Nuclear localization signals fused to the 

35 polypeptides of the present invention can target the protein to a specific subcellular 
localization, while covalent heterodimer or homodimers can increase or decrease the 
activity of a fusion protein. Fusion proteins can also create chimeric molecules having 
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more than one function. Finally, fusion proteins can increase solubility and/or stability 
of the fiised protein compared to the non-fused protein. All of the types of fusion 
proteins described above can be made by modifying the following protocol, which 
outlines the fusion of a polypeptide to an IgG molecule, or the protocol described in 
5 Example 5, 

Briefly, the human Fc portion of the IgG molecule can be PGR amplified, using 
primers that span the 5' and 3' ends of the sequence described below. These primers 
also should have convenient restriction enzyme sites that will facilitate cloning into an 
expression vector, preferably a mammalian expression vector. 

10 For example, if pC4 (Accession No.209646) is used, the human Fc portion can 

be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
destroyed. Next, the vector containing the human Fc portion is re-restricted with 
BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PGR protocol described in Example 1 , is ligated into this BamHI site. Note that 

15 the polynucleotide is cloned without a stop codon, otherwise a fusion protein will not 
be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
20 heterologous signal sequence. (See, e.g., WO 96/34891.) 

Human IgG Fc region: 

GGGATGCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCGAGCGTGCC 
CAGCAGCTGAATTCGAGGGTGGACGGTCAGTGTTCCTCTTCGCCCCAAAACC 

25 CAAGGACACCCrCATGATCTCCGGGACTCCTGAGGTCAGATGCGTGGTGGT 
GGAGGTAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACG 
GCGTGGAGGTGCATAATGGCAAGACAAAGCCGCGGGAGGAGCAGTACAAG 
AGGACGTACCGTGTGGTCAGGGTCGTCACCGTCCTGCACCAGGACTGGCTG 
AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAACCGCG 

30 ATCGAGAAAACCATCTGGAAAGCCAAAGGGCAGCGCCGAGAACCACAGGT 
GTAGACCGTGCCCGCATCCGGGGATGAGCTGACCAAGAACCAGGTCAGCCT 
GACCTGCGTGGTCAAAGGCTTCTATGCAAGCGACATCGCCGTGGAGTGGGA 
GAGGAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGG 
ACTGGGACGGCTCGTTCTTGCTCTACAGCAAGCTCACCGTGGACAAGAGCA 

35 GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGC 
ACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGAGTGC 
GACGGCCGCGACTCTAGAGGAT (SEQ ID NO: 1) 
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Example 10: Production of an Antibody from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of methods. 
(See, Current Protocols, Chapter 2.) For example, cells expressing a polypeptide of 
5 the present invention is administered to an animal to induce the production of sera 

containing polyclonal antibodies. In a preferred method, a preparation of the secreted 
protein is prepared and purified to render it substantially free of natural contaminants. 
Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 

10 In the most preferred method, the antibodies of the present invention are 

monoclonal antibodies (or protein binding fragments thereoO. Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al.. Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et al., Eur. J. 
Inununol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 

15 Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing an animal (preferably a mouse) with polypeptide or, more 
preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to culture cells in Earle's 
modified Eaglets medium supplemented with 10% fetal bovine serum (inactivated at 

20 about 56°C), and supplemented with about 10 g/1 of nonessential amino acids, about 
1,000 U/ml of penicillin, and about 100 p.g/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 

25 (SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma cells 
obtained through such a selection are then assayed to identify clones which secrete 
antibodies capable of binding the polypeptide. 

30 Alternatively, additional antibodies capable of binding to the polypeptide can be 

produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to inmiunize an animal, preferably a 

35 mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody 
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whose ability to bind to the protein-specific antibody can be blocked by the polypeptide. 
Such antibodies comprise anti-idiotypic antibodies to the protein-specific antibody and 
can be used to immunize an animal to induce formation of further protein-specific 
antibodies. 

5 It will be appreciated that Fab and F(ab')2 and other firagments of the antibodies 

of the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, 
secreted protein-binding fragments can be produced through the application of 

10 recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced using 
genetic constructs derived from hybridoma cells producing the monoclonal antibodies 
described above. Methods for producing chimeric antibodies are known in the art. 

15 (See, for review, Morrison. Science 229:1202 (1985); Oi et al., BioTechniques 4:214 
(1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi et al., EP 171496; 
Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson et al., WO 
8702671; Boulianne et ah. Nature 312:643 (1984); Neuberger et al.. Nature 314:268 
(1985).) 

20 

Example 11: Production Of Secreted Protein For High-Throughput 

Screening Assays 

The following protocol produces a supernatant containing a polypeptide to be 

tested. This supernatant can then be used in the Screening Assays described in 
25 Examples 13-20. 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 

(Img/mi in PBS) 1:20 in PBS (w/o calcium or magnesium 17-516F Biowhittaker) for a 

working solutioti of 50ug/ml. Add 200 ul of this solution to each well (24 well plates) 

and incubate at RT for 20 minutes. Be sure to distribute the solution over each well 
30 (note: a 12-channel pipetter may be used with tips on every other channel). Aspirate off 

the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered Saline). The 

PBS should remain in the well until just prior to plating the cells and plates may be 

poly-lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10^ cells/well in .5ml 
35 DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine 

(12-604F Biowhittaker))/ 10% heat inactivated FBS(14-503F Biowhittaker)/ Ix 

Penstrep( 17-602E Biowhittaker). Let the cells grow overnight. 
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The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5nil Optimem I (31985070 Gibco/BRL)/96-weU plate. 
With a small volume multi-channel pipetter, aliquot approximately 2ug of an expression 
vector containing a polynucleotide insert, produced by the methods described in 
5 Examples 8 or 9, into an appropriately labeled 96-well round bottom plate. With a 
multi-channel pipetter, add 50ul of the Lipofectamine/Optimenri I mixture to each well. 
Pipette up and down gently to mix. Incubate at RT 15-45 minutes. After about 20 
minutes, use a multi-channel pipetter to add 150ul Optimem I to each well. As a 
control, one plate of vector DNA lacking an insert should be transfected with each set of 
10 transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 
much time on PBS. First, person A aspirates off the media from four 24-well plates of 
cells, and then person B rinses each well with .5-lml PBS. Person A then aspirates off 
15 PBS rinse, and person B, using al2-channel pipetter with tips on every other channel, 
adds the 200ul of DNA/Lipofectamine/Optimem I complex to the odd wells first, then to 
the even wells, to each row on the 24-well plates. Incubate at 37°C for 6 hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in DMEM 
^ with Ix penstrep, or CHO-5 media (see below) with 2mm glutamine and Ix penstiep. 
20 (BSA (81-068-3 Bayer) lOOgm dissolved in IL DMEM for a 10% BSA stock 

solution). Filter the media and collect 50 ul for endotoxin assay in 15ml polystyrene 
conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of 
the incubation period. Person A aspirates off the U*ansfection media, while person B 

25 adds 1 .5ml appropriate media to each well. Incubate at ST'C for 45 or 72 hours 
depending on the media used: 1 %BS A for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml deep 
well plate and the remaining supernatant into a 2ml deep well. The supematants from 
each well can then be used in the assays described in Examples 13-20. 

30 It is specifically understood that when activity is obtained in any of the assays 

described below using a supernatant, the activity originates from either the polypeptide 
directly (e.g., as a secreted protein) or by the polypeptide inducing expression of other 
proteins, which are then secreted into the supernatant. Thus, the invention further 
provides a method of identifying the protein in the supernatant characterized by an 

35 activity in a particular assay. 
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HGS-CHO-5 medium formulation: 
Inorganic Salts 



CaC12 (anhyd) 


116.6 mg/L 


CuS0^-5H,0 


0.00130 


Fe(N0,),-9H,0 


0.050 


FeS0^-7H,0 


0.417 


KCI 


311.80 


MgCI, 


28.64 


MgSO, 


48.84 


NaCl 


6995.50 


NaHCO, 


2400.0 


NaH,PO,-H,0 


62.50 


Na,HP04 


71.02 


ZnS04-7H,0 


.4320 



5 Lipids 



Arachidonic Acid 


.002 mg/L 


Cholesterol 


1.022 


DL-aipha- 
Tocopherol-Acetate 


.070 


Linoleic Acid 


0.0520 


Linolenic Acid 


0.010 


Myristic Acid 


0.010 


Oleic Acid 


0.010 


Palmitric Acid 


0.010 


Palmitic Acid 


0.010 


Pluronic F-68 


100 


Stearic Acid 


0.010 


Tween 80 


2.20 


Carbon Source 


1 D-Glucose 1 4551 mg/L | 


Amino Acids 


L- Alanine 


130.85 mg/ml 


L-Arginine-HCL 


147.50 


L-Asparagine-H,0 


7.50 


L-Aspartic Acid 


6.65 


L-Cystine-2HCL- 
H,0 


29.56 


L-Cystine-2HCL 


31.29 


L-Glutamic Acid 


7.35 


L-Glutamine 


365.0 


Glycine 


18.75 


L-Histidine-HCL- 


52.48 
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H,0 




L-Isoleucine 


106.97 


L-Leucine 


111.45 


L-Lysine HCL 


163.75 


L-Methionine 


32.34 


L-Phenylalainine 


68.48 


L-Proline 


40.0 


L-Serine 


26:25 


L-Threonine 


101.05 


L-Tryptophan 


19.22 


L-Tryrosine-2Na- 
2H,0 


91.79 


L-Valine 


99.65 



Vitamins 



Biotin 


0.0035 mg/L 


D-Ca Pantothenate 


3.24 


Choline Chloride 


11.78 


Folic Acid 


4.65 


i-Inositol 


15.60 


Niacinamide 


3.02 


Pyridoxal HCL 


3.00 


Pyridoxine HCL 


0.031 


Riboflavin 


0.319 


Thiamine HCL 


3.17 


Thymidine 


0.365 


Vitamin B„ 


0.680 



Other Components 



HEPES Buffer 


25 mM 


Na Hypoxanthine 


2.39 mg/L 


Lipoic Acid 


0.105 


Sodium Putrescine-2HCL 


0.081 


Sodium Pyravate 


55.0 


Sodium Selenite 


0.0067 


Ethanolamine 


20uM 


Ferric Citrate 


0.122 


Methyl-B-Cyclodextrin complexed with 
Linoleic Acid 


41.70 


Methyl-B-Cyclodextrin complexed with 
Oleic Acid 


33.33 


Methyl-B-Cyclodextrin complexed with 
Retinal Acetate 


10 



5 



Adjust osmolarity to 327 mOsm 
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Example 12: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and proliferation 
of cells is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs 
pathway bind to gamma activation site "GAS" elements or interferon-sensitive 

5 responsive element ("ISRE"), located in the promoter of many genes. The binding of a . 
protein to these elements alter the expression of the associated gene. 

GAS and IS RE elements are recognized by a class of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six 
members of the STATs family. Stat I and Stat3 are present in many cell types, as is 

10 Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T helper class I, cells after treatment with 
IL-12. StatS was originally called mammary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in tissue 
culture cells by many cytokines. 

15 The STATs are activated to translocate from the cytoplasm to the nucleus upon 

tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, 
Jakl, Jak2, and Jak3. These kinases display significant sequence similarity and are 
generally catalytically inactive in resting cells. 

20 The Jaks are activated by a wide range of receptors summarized in the Table 

below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621-51 
(1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-1 1, IL- 
12, IL-15, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and 

25 (b) Class 2 includes IFN-a, IFN-g, and IL-10. The Class 1 receptors share a 

conserved cysteine motif (a set of four conserved cysteines and one tryptophan) and a 
WSXWS motif (a membrane proxial region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID 
NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
30 activate STATs, which then translocate and bind to GAS elements. This entire process 
is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and cytokines are 
35 known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using GAS 
elements linked to reporter molecules, activators of the Jaks-STATs pathway can be 
identified. 
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10 



15 



20 



25 



30 



35 



40 



45 



ISRE 




JAKq 






*3 1 r\ 1 O 


w/\oicicmenisi or 












Ligand 


tvk2 


Jakl 


Jak2 


Jak3 






BFN familv 














IFN-a/B 


+ 


+ 


- 


- 


1,2,3 


ISRE 


IFN-g 




+ 


+ 


- 


1 


GAS 


(IRFl>Lys6>IFP) 














IMO 


+ 


7 


7 


- 


1,3 




rolSO familv 














IL-6 (Pleiotrohic) 


+ 






7 


1,3 


GAS 


(IRFl>Lys6>IFP) 












D- 11 (Pleiotrohic) 


7 


+ 


7 


7 


1.3 




OnM(Pleiotrohic) 


7 


+ 


+ 


7 


1.3 




LIF(Pleiotrohic) 


7 


+ 




7 


1.3 




CNTF(Pleiotrohic) 


-/+ 




+ 


7 


1.3 




G-CSF(Pleiotrohic) 


7 


+ 


7 


7 


1.3 




IL-12(Pleiotrohic) 


+ 


- 




+ 


1.3 




e-C familv 














IL-2 (lymphocytes) 


- 


+ 


- 


+ 


1,3,5 


GAS 


IL-4 (lymph/myeloid) 


- 


+ 


- 




6 


GAS (IRFl = IFP 


»Ly6)(IgH) 












IL-7 (lymphocytes) 


- 




-• 


+ 


5 


GAS 






f 






D 




IL-13 (lymphocyte) 




+ 


7 


7 


6 


GAS 


IL-15 


7 


+ 


7 


+ 


5 


GAS 


eo 140 familv 














IL-3 (myeloid) 




- 


+ 


- 


5 


GAS 


(IRFl>IFP»Ly6) 














IL-5 (miyeloid) 




- 


+ 


- 


5 


GAS 


GM-CSF (myeloid) 




- 


+ 


- 


5 


GAS 


Growth hormone familv 












GH 


7 




+ 


- 


5 




PRL 


7 


+/- 


+ 




1.3.5 




EPO 


7 








5 


GAS(B- 


CAS>IRP1 =IFP»Ly6) 










Receptor Tvrosine Kinases 












EGF 


7 


+ 






1.3 


GAS (IRFl) 


PDGF 


7 




+ 




1.3 




CSF-i 


7 




+ 




1.3 


GAS (not IRFl) 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 13-14, a PCR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5' primer contains four tandem copies 
of the GAS binding site found in the IRFl promoter and previously demonstrated to 
5 bind STATs upon induction with a range of cytokines (Rothman et al.. Immunity 

1 :457-468 (1994).), although other GAS or ISRE elements can be used instead. The 5* 
primer also contains 18bp of sequence complementary to the SV40 early promoter 
sequence and is flanked with an Xhol site. The sequence of the 5* primer is: 
5*:GCGCCTCGAGATTTCCCCGAAATCrAGATTTCCCCGAAATGATTTCCCCG 
1 0 AAATG ATTTCCCCG AAATATCTGCC ATCTCAATTAG:3 ' (SEQ ID NO:3) 

The downstream primer is complementary to the S V40 promoter and is flanked 
with a Hind IH site: 5^GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID 
NO:4) 

PCR amplification is performed using the SV40 promoter template present in 
15 the B-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers confirms that the insert contains the following 
sequence: 

5 ' :CrCGAG ATTTCCCCG AAATCT AG ATTTCCCCG AAATG ATTTCCCCG AAAT^ 
20 ATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCC 
CTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCrCCGC 
CCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGC 
CTCrGAGCTATTCCAGAAGTAGTGAGGAGGCirrilTGGAGGCCTAGGCTTT 
TGCAAA AAGCTT :3' (SEQIDNO:5) 

25 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted alkaline 
phosphatase, or "SEAP." Clearly, however, any reporter molecule can be instead of 
SEAP, in this or in any of the other Examples. Well known reporter molecules that can 

30 be used instead of SEAP include chloramphenicol acetyltransferase (CAT), luciferase, 
alkaline phosphatase, B-galactosidase, green fluorescent protein (GFP), or any protein 
detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindlll and 

35 Xhol, effectively replacing the SV40 promoter with the amplified GAS:SV40 promoter 
element, to create the GAS-SEAP vector. However, this vector does not contain a 
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neomycin resistance gene, and therefore, is not preferred for mammalian expression 
systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
5 Sail and NotI, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP-1 (Clontech), using these restriction sites in the multiple cloning 
site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 
mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13-14. 

10 Other constructs can be made using the above description and replacing GAS 

with a different promoter sequence. For example, construction of reporter molecules 
containing NJTC-B and EGR promoter sequences are described in Examples 15 and 16. 
However, many other promoters can be substituted using the protocols described in 
these Examples. For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be 

15 substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, II- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 
construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUVAC (aortic), or Cardiomyocyte. 

20 Example 13: Hiph-Throughput Screening Assay for T-cell Activity, 

The following protocol is used to assess T-cell activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks-STATS 

25 signal transduction pathway. The T-cell used in this assay is Jurkat T-cells (ATCC 

Accession No. TIB-152), although Molt-3 cells (ATCC Accession No. CRL-1552) and 
Molt-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 

Jurkat T-cells axe lymphoblastic CD4+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the GAS- 

30 SEAP/neo vector using DMRIE-C (Life Technologies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
20,0(X) cells per well and transfectants resistant to 1 mg/ml genticin selected. Resistant 
colonies are expanded and then tested for their response to increasing concentrations of 
interferon gamma. The dose response of a selected clone is demonstrated. 

35 Specifically, the following protocol will yield sufficient cells for 75 wells 

containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in RPMI 
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+ 10% serum with l%Pen-Strep. Combine 2.5 mis of OFTI-MEM (Life Technologies) 
with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM containing 50 ul 
of DMREE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
5 number of cells (10^ per transfection), and resuspend in OPTI-MEM to a final 

concentration of 10^ cells/ml. Then add 1ml of 1 x 10^ cells in OPTI-MEM to T25 flask 
and incubate at 3TC for 6 hrs. After the incubation, add 10 ml of RPMI + 15% serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 
serum, 1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with supematants 
10 containing a polypeptide as produced by the protocol described in Example 1 1 . 

On the day of treatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supematants being 
screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 
15 million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells into 
a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul 
of cells into each well (therefore adding 1(X), 000 cells per well). 

After all the plates have been seeded, 50 ul of the supematants are transferred 
20 directly from the 96 well plate containing the supematants into each well using a 12 

channel pipette. In addition, a dose of exogenous interferon gamma (0.1, LO, 10 ng) 
is added to wells H9, HIO, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat cells treated with supematants arc placed in 
25 an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples 
from each well are then transferred to an opaque 96 well plate using a 12 channel 
pipette. The opaque plates should be covered (using sellophene coyers) and stored at - 
20^C until SEAP assays are performed according to Example 17. The plates 
containing the remaining treated cells are placed at 4^C and serve as a source of material 
30 for repeating the assay on a specific well if desired. 

As a positive control, 100 Unit/ml interferon ganuna can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 
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Example 14: High-Thro ughput Screening Assay Identifying Myeloid 
Activity 

The following protocol is used to assess myeloid activity by identifying factors, 
such as grov/th factors and cytokines, that may proliferate or differentiate myeloid cells. 
5 Myeloid cell activity is assessed using the GAS/SEAP/Neo construct produced in 

Example 12. Thus, factors that increase SEAP activity indicate the ability to activate the 
Jaks-STATS signal transduction pathway. The myeloid cell used in this assay is U937, 
a pre-monocyte cell line, although TF-1, HL60, or KGl can be used. 

To transienUy transfect U937 cells with the GAS/SEAP/Neo construct produced 
10 in Example 12, a DEAE-Dextran method (Kharbanda at. al., 1994, Cell Growth & 
Differentiation, 5:259-265) is used. First, harvest 2xl0e7 U937 cells and wash with 
PBS. The U937 cells are usually grown in RPMI 1640 medium containing 10% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
mg/ml streptomycin. 

1 5 Next, suspend the cells in I ml of 20 mM Tris-HCl (pH 7.4) buffer containing 

0.5 mg/ml DEAE-Dextran. 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM 
KCl, 375 uM Na2HP04.7H20. 1 mM MgCl2, and 675 uM CaCla- Incubate at 370C 
for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
20 resuspend in 10 ml complete medium and incubate at 37^C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G418. The G418-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10* cells (this is enough for ten 96-well 
25 plates assay) and wash with PBS. Suspend the cells in 200 ml above described growth 
medium, with a final density of 5x10' cells/ml. Plate 200 ul cells per well in the 96- 
well plate (or IxlC cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 1 1 . 
Incubate at 37^C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon gamma 
30 can be used which is known to activate U937 cells. Over 30 fold induction is typically 
observed in the positive control wells. SEAP assay the supernatant according to the 
protocol described in Example 17. 
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Example 15; High»Throuphput Screening Assay Identifying Neuronal 
Activity. 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these genes, 
5 EGRl (early growth response gene 1), is induced in various tissues and cell types upon . 
activation. The promoter of EGRl is responsible for such induction. Using the EGRl 
promoter linked to reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 

10 differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 

phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The 
EGRl gene expression is activated during this treatment. Thus, by stably transfecting 
PC 12 cells with a construct containing an EGR promoter linked to SEAP reporter, 
activation of PC 12 cells can be assessed. 

15 The EGR/SEAP reporter construct can be assembled by the following protocol. 

The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al.. Oncogene 6:867-871 
(1991)) can be PCR amplified from human genomic DNA using the following primers: 

5* GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3* (SEQ ID NO:6) 
20 5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3* (SEQ ID NO:7) 

Using the GAS:SEAP/Neo vector produced in Example 12, EGRl amplified 
product can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector 
using restriction enzymes Xhol/Hindlll, removing the GAS/SV40 stuffer. Restrict the 
25 EGRl amplified product with these same enzymes. Ligate the vector and the EGRl 
promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-well plate, and 

30 allowed to air dry for 2 hr. 

PC 12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is done 

35 every three to four days. Cells are removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 
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Transfect the EGR/SEAP/Neo construct into PC 12 using the Lipofectamine 
protocol described in Example 1 1 . EGR-SEAP/PC12 stable cells are obtained by 
growing the cells in 300 ug/ml G4 18. The G418-free medium is used for routine 
growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
5 for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
(Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 
10 The next morning, remove the medium and wash the cells with PBS. Scrape 

off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 

the cell number and add more low serum medium to reach final cell density as 5x1 0^ 
cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
15 1x1 05 cells/well). Add 50 ul supernatant produced by Example 11, 37^0 for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

20 

Example 16: High-Throughput Screening Assay for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide variety 

of agents including the inflammatory cytokines IL-1 and TNF, CD30 and CD40, 
lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, and by 

25 expression of certain viral gene products. As a transcription factor, NF-kB regulates 
the expression of genes involved in immune cell activation, control of apoptosis (NF- 
kB appears to shield cells from apoptosis), B and T-cell development, anti-viral and 
antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 

30 (Inhibitor kB). However, upon stimulation, I- kB is phosphorylated and degraded, 

causing NF- kB to shuttle to the nucleus, thereby activating transcription of target 

genes. Target genes activated by NF- kB include IL-2, IL-6, GM-CSF, ICAM-1 and 
class 1 MHC. 
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Ehie to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB pronnoter element are used to screen the supematants 
produced in Example 1 1 . Activators or inhibitors of NF-kB would be useful in treating 
diseases. For example, inhibitors of NF-kB could be used to treat those diseases 
5 related to the acute or chronic activation of NF-kB, such as rheumatoid arthritis. 

To construct a vector containing the NF-kB promoter element, a PGR based 

strategy is employed. The upstream primer contains four tandem copies of the NF-kB 

binding site (GGGGACTTTCCC) (SEQ ID NO:8), 18 bp of sequence complementary 
to the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
10 S^GCGGCCTCGAGGGGACITTCCCGGGGACTTTCCGGGGACT^ 
TTTCCATCCrGCCATCrCAATTAG:3^ (SEQIDNO:9) 

The downstream primer is complementary to the 3' end of the SV40 promoter 
and is flanked with a Hind III site: 

5^GCGGCAAGCrTTTTGCAAAGCCTAGGC:3* (SEQ ID NO:4) 
15 PGR amplification is performed using the SV40 promoter template present in 

the pB-gal:promoter plasmid obtained from Clontech. The resulting PGR fragment is 
digested with Xhol and Hind HI and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence: 

20 5 ' :CTCGAGGGGACTTTCCCGGGGACTTTCCGGGG ACTTTCCGGGACTTTCC 
ATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCA 
TCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACT 
AA'rrriU'irrATTTATGCAGAGGCCGAGGCCGCCrCGGCCTCTGAGCTATTC 
CAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTT: 

25 3' (SEQ ID NO: 10) 

Next, replace the SV40 minimal promoter element present in the pSEAP2- 

promoter plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and Hindlll. 

However, this vector does not contain a neomycin resistance gene, and therefore, is not 
30 preferred for manmialian expression systems. 

In order to generate stable mammalian cell lines, the NF-KB/SV40/SEAP 

cassette is removed from the above NF-kB/SEAP vector using restriction enzymes Sail 
and NotI, and inserted into a vector containing neomycin resistance. Particularly, the 
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NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Clontech), replacing the GFP 
gene, after restricting pGFP-1 with Sail and Notl. 

Once NF-kB/S V40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to the protocol described in Example 13. Similarly, 
5 the method for assaying supematants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to 
wells H9, H 10, and HI 1, with a 5-10 fold activation typically observed. 

Example 17; Assav for SEAP Activity 
10 As a reporter molecule for the assays described in Examples 13-16, SEAP 

activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the 

following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 

Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 \il of 2.5x 
15 dilution buffer into Optiplates containing 35 ^.1 of a supernatant. Seal the plates with a 

plastic sealer and incubate at 65^C for 30 min. Separate the Optiplates to avoid uneven 
heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 
prime with the Assay Buffer. Add 50 |il Assay Buffer and incubate at room 

20 temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 \ll Reaction Buffer and incubate at room temperature for 20 
minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 
takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at each 
time and start the second set 10 minutes later. 

25 Read the relative light unit in the luminometer. Set H 12 as blank, and print the 

results. An increase in chemiluminescence indicates reporter activity. 



Reaction Buffer Formulation: 

# of plates Rxn buffer diluent (ml) CSPD (ml) 

To 60 3 

1 1 65 3.25 

12 70 3.5 

13 75 3.75 

14 80 4 
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Example 18: High-Throughput Screening Assay Identifying Changes in 
Small Molecule Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
5 molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supematants which 
bind to receptors of a particular cell. Although the following protocol describes an 
assay for calcium, this protocol Ccui easily be modified to detect changes in potassium, 
sodium, pH, membrane potential, or any other small molecule which is detectable by a 

10 fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-3, used here. 

15 For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 

96- well plate with clear bottom. The plate is incubated in a COj incubator for 20 hours. 
The adherent cells are washed two times in Biotek washer with 200 ul of HBSS 
(Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg/ml fluo-3 is made in 10% pluronic acid DMSO. To 

20 load the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each well. The plate is 
incubated at 3T'C in a CO^ incubator for 60 min. The plate is washed four times in the 
Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5x10^ cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 

25 fluo-3 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 

The tube is then placed in a 37**C water bath for 30-60 min. The cells are washed twice 
with HBSS, resuspended to 1x10* cells/ml, and dispensed into a microplate, 100 
ul/well. The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed once 
in Denley Cell Wash with 200 ul, followed by an aspiration step to 100 ul final volume. 

30 For a non-cell based assay, each well contains a fluorescent molecule, such as 

fluo-3. The supernatant is added to the well, and a change in fluorescence is detected. 

To measure the fluorescence of intracellular calcium, the FLEPR is set for the 
following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and 

35 (6) Sample addition is 50 ul. Increased emission at 530 nm indicates an extracellular 
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signaling even which has resulted in an increase in the intracellular Ca"*^ 
concentration. 

Example 19 ; High-Throughput Screening Assay Identifying Tyrosine 
5 Kinase Actiyitv 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase 
RPTK) group are receptors for a range of mitogenic and metabolic growth factors 
including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. In 

10 addition there are a large family of RPTKs for which the corresponding tigand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor dimerization, 
resulting in transphosphorylation of the receptor subunits and activation of the 

15 cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include receptor 
associated tyrosine kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and non- 
receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, members 
of which mediate signal transduction triggered by the cytokine superfamily of receptors 
(e.g., the Interleukins, Interferons. GM-CSF, and Leptin). 

20 Because of the wide range of known factors capable of stimulating tyrosine 

kinase activity, the identification of novel human secreted proteins capable of activating 
tyrosine kinase signal transduction pathways are of interest. Therefore, the following 
protocol is designed to identify those novel human secreted proteins capable of 
activating the tyrosine kinase signal transduction pathways. 

25 Seed target cells (e.g., primary keratinocytes) at a density of approximately 

25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, BL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 
with 100 ml of cell culture grade type I collagen {50 mg/ml), gelatin (2%) or polylysine 

30 (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. Louis, MO) or 
10% Matrigel purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed 
with PBS and stored at 4^C. Cell growth on these plates is assayed by seeding 5,(XX) 
cells/well in growth medium and indirect quantitation of cell number through use of 
alamarBlue as described by the manufacturer Alamar Biosciences, Inc. (Sacramento, 

35 CA) after 48 hr. Falcon plate covers #3071 from Becton Dickinson (Bedford.MA) are 
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Example 18; High-Throughput Screening Assay Identifying Changes in 
Small Molecule Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
5 molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supematants which 
bind to receptors of a particular cell. Although the following protocol describes an 
assay for calcium, this protocol can easily be modified to detect changes in potassium, 
sodium, pH, membrane potential, or any other small molecule which is detectable by a 

10 fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-3, used here. 

15 For adherent cells, seed the cells at 10.000 -20.000 cells/well in a Co-star black 

96-well plate with clear bottom. The plate is incubated in a COj incubator for 20 hours. 
The adherent cells are washed two times in Biotek washer with 200 ul of HBSS 
(Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg/ml fluo-3 is made in 10% pluronic acid DMSO. To 

20 load the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each welL The plate is 
incubated at 37°C in a COj incubator for 60 min. The plate is washed four times in the 
Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5x10^ cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 

25 fluo-3 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 

The tube is then placed in a 37°C water bath for 30-60 min. The cells are washed twice 
with HBSS, resuspended to 1x10* cells/ml, and dispensed into a microplate, 100 
ul/welL The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed once 
in Denley CellWash with 200 ul, followed by an aspiration step to 100 ul final volume. 

30 For a non-cell based assay, each well contains a fluorescent molecule, such as 

fluo-3. The supernatant is added to the well, and a change in fluorescence is detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and 

35 (6) Sample addition is 50 ul. Increased emission at 530 nm indicates an extracellular 
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signaling even which has resulted in an increase in the intracellular Ca++ 
concentration. 

Example 19 : Hieh-Throughput Screening Assay Identifying Tyrosine 
5 Kinase Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase 
RPTK) group are receptors for a range of mitogenic and metabolic growth factors 
including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. In 

10 addition there are a large family of RPTKs for which the corresponding ligand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor dimerization, 
resulting in transphosphorylation of the receptor subunits and activation of the 

15 cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include receptor 
associated tyrosine kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and non- 
receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, members 
of which mediate signal transduction triggered by the cytokine superfamily of receptors 
(e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 

20 Because of the wide range of known factors capable of stimulating tyrosine 

kinase activity, the identification of novel human secreted proteins capable of activating 
tyrosine kinase signal transduction pathways are of interest. Therefore, the following 
protocol is designed to identify those novel human secreted proteins capable of 
activating the tyrosine kinase signal transduction pathways, 

25 Seed target cells (e.g., primary keraiinocytes) at a density of approximately 

25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 
with 100 ml of cell culture grade type I collagen <50 mg/ml), gelatin (2%) or polylysine 

30 (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. Louis, MO) or 
10% Matrigel purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed 

with PBS and stored at 4^0. Cell growth on these plates is assayed by seeding 5,000 
cells/well in growth medium and indirect quantitation of cell number through use of 
alamarBlue as described by the manufacturer Alamar Biosciences, Inc. (Sacramento, 
35 CA) after 48 hr. Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are 
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used to cover the Loprodyne Silent Screen Plates. Falcon Microtest III cell culture 
plates can also be used in some proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 
Loprodyne plates (20,000/200ml/well) and cultured overnight in complete medium. 
5 Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 

minutes treatment with EGF (6{)ng/ml) or 50 ul of the supernatant produced in Example 
1 1 , the medium was removed and 100 ml of extraction buffer ((20 mM HEPES pH 
7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM Na4P207 
and a cocktail of protease inhibitors (# 1836170) obtained from Boeheringer Mannheim 

10 (Indianapolis, IN) is added to each well and the plate is shaken on a rotating shaker for 
5 minutes at 4^C, The plate is then placed in a vacuum transfer manifold and the extract 
filtered through the 0.45 mm membrane bottoms of each well using house vacuum. 
Extracts are collected in a 96-well catch/assay plate in the bottom of the vacuum 
manifold and inunediately placed on ice. To obtain extracts clarified by centriftigation, 

15 the content of each well, after detergent solubilization for 5 minutes, is removed and 
centrifuged for 15 minutes at 4^C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 
Generally, the tyrosine kinase activity of a supernatant is evaluated by 

20 determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biotinylated peptides that can be used for this purpose include 
PSKl (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

25 The tyrosine kinase reaction is set up by adding the following components in 

order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5mM 
ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 mM beta-glycerophosphate, ImM EGTA. lOOmM MgCl2, 5 mM MnCl2, 
0.5 mg/ml BSA), then 5ul of Sodium Vanadate(lmM), and then 5ul of water. Mix the 

30 components gently and preincubate the reaction mix at 30^C for 2 niin. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm 
EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 

35 mixture to a microtiter plate (MTP) module and incubating at 37<^C for 20 min. This 
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allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/ml)) to each well and incubate at 37°C for one hour. Wash the well as 
5 above. 

Next add l{X)ul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
absorbance of the sample at 405 nm by using ELISA reader. The level of bound 
peroxidase activity is quantitated using an ELISA reader and reflects the level of 
10 tyrosine kinase activity. 

Example 20; High-Throughput Screening Assay Identifying 
Phosphorylation Activity 

As a potential alternative and/or compliment to the assay of protein tyrosine 
15 kinase activity described in Example 19, an assay which detects activation 

(phosphorylation) of major intracellular signal transduction intermediates can also be 
used. For example, as described below one particular assay can detect tyrosine 
phosphorylation of the Erk-l and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 
20 Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
substituting these molecules for Erk-1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELISA 
plate with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then 
25 rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT, The protein G plates 
are then treated with 2 commercial monoclonal antibodies (lOOng/well) against Erk-1 
and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of the 

above described molecules.) After 3-5 rinses with PBS, the plates are stored at 4^C 
30 until use. 

A43 1 cells are seeded at 20,(X)0/well in a 96-well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the supematants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts 
35 filtered directly into the assay plate. 
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After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase (lOng/welJ) is used in place 
of A43 1 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
(lug/ml) which specifically recognizes the phosphorylated epitope of the Erk-1 and 
5 Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard procedures. The 
bound polyclonal antibody is then quantitated by successive incubations with 
Europium-streptavidin and Europium fluorescence enhancing reagent in the Wallac 
DELFIA instrument (time-resolved fluorescence). An increased fluorescent signal over 
background indicates a phosphorylation. 

10 

Example 21; Method of Determining Alterations in a Gene 
Corresponding to a Polynucleotide 

RNA isolated from entire families or individual padents presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated from 
15 these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA is 
then used as a template for PCR, employing primers surrounding regions of interest in 

SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95*'C for 30 

seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at TO'^C, using buffer 

solutions described in Sidransky, D., et al.. Science 252:706 (1991). 

20 PCR products is then sequenced using primers labeled at their 5' end with T4 

p>olynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). 
The intron-exon borders of selected exons is also determined and genomic PCR 
products analyzed to confirm the results. PCR products harboring suspected mutations 
is then cloned and sequenced to validate the results of the direct sequencing. 

25 PCR products is cloned into T-tailed vectors as described in Holton, T.A, and 

Graham, M.W., Nucleic Acids Research, 19:1 156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals is identified by 
mutations not present in unaffected individuals. 

Genomic rearrangements are also observed as a method of determining 

30 alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5 - 
triphosphate (Boehringer Manheim), and FISH performed as described in Johnson, 
Cg. et al.. Methods Cell Biol. 35:73-99 (1991). Hybridization with the labeled probe is 
carried out using a vast excess of human cot-1 DNA for specific hybridization to the 

35 corresponding genomic locus. 
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Chromosomes are counterstained with 4,6-diamino-2-phenylidole and 
propidium iodide, producing a combination of C- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 
Brattleboro, VT) in combination with a cooled charge-coupled device camera 
5 (Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Cv. 
et al., Genet. Anal. Tech. Appl., 8:75 (1991).) Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 
10 translocations. These alterations are used as a diagnostic marker for an associated 
disease. 

Example 22: Method of Detecting Abnormal Levels of a Polypeptide in a 
Biological Sample 

15 A polypeptide of the present invention can be detected in a biological sample, 

and if an increased or decreased level of the polypeptide is detected, this polypeptide is 
a marker for a particular phenotype. Methods of detection are numerous, and thus, it is 
understood that one skilled in the art can modify the following assay to fit their 
particular needs. 

20 For example, antibody-sandwich ELIS As are used to detect soluble 

polypeptides in a sample, preferably a biological sample. Wells of a microtiter plate are 
coated with specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The 
antibodies are either monoclonal or polyclonal and are produced by the method 
described in Example 10. The wells are blocked so that non-specific binding of the 

25 polypeptide to the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used to 
validate results. The plates are then washed three times with deionized or distilled water 
to remove unbounded polypeptide. 

30 Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 

concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 

35 phosphate (NPP) substrate solution to each well and incubate 1 hour at room 

temperature. Measure the reaction by a microtiter plate reader. Prepare a standard 
curve, using serial dilutions of a control sample, and plot polypeptide concentration on 
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the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard curve. 

Example 23: Formulating a Polypeptide 

5 The secreted polypeptide composition will be formulated and dosed in a fashion 

consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the secreted polypeptide 
alone), the site of delivery, the method of administration, the scheduling of 
administration, and other factors known to practitioners. The "effective amount" for 

10 purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of secreted 
polypeptide administered parenterally per dose will be in the range of about 1 p.g/kg/day 
to 10 mg/kg/day of patient body weight, although, as noted above, this will be subject 
to therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and 

15 most preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If 
given continuously, the secreted polypeptide is typically administered at a dose rate of 
about 1 ^g/kg/hour to about 50 ^tg/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous 
bag solution may also be employed. The length of treatment needed to observe changes 

20 and the interval following treatment for responses to occur appears to vary depending 
on the desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention are 
administered orally, rectally, parenterally, intracistemally, intravaginally, 
intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 

25 patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" refers 
to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to modes 
of administration which include intravenous, intramuscular, intraperitoneal, intrastemal, 
subcutaneous and intraarticular injection and infusion. 

30 The secreted polypeptide is also suitably administered by sustained-release 

systems. Suitable examples of sustained-release compositions include semi-permeable 
polymer matrices in the form of shaped articles, e.g., films, or mirocapsules. 
Sustained-release matrices include polylactides (U.S. Pat. No, 3,773,919, EP 58,481), 
copolymers of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al., 

35 Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl methacrylate) (R. Langer et 
al., J. Biomed. Mater. Res. 15:167-277 (1981), and R. Langer, Chem. Tech. 12:98- 
105 (1982)), ethylene vinyl acetate (R. Langer et al.) or poly-D- (-)-3-hydroxybutyric 
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acid (EP 133,988). Sustained-release compositions also include liposomally entrapped 
polypeptides. Liposomes containing the secreted polypeptide are prepared by methods 
known per se: DE 3,218,121; Epstein et al., Proc. Natl. Acad. Sci. USA 82:3688-3692 
(1985); Hwang et al., Proc. Natl. Acad. Sci. USA 77:4030-4034 (1980); EP 52,322; 
5 EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese Pat. AppL 83-1 18008; 

U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. Ordinarily, the liposomes 
are of the small (about 200-800 Angstroms) unilamellar type in which the lipid content 
is greater than about 30 moL percent cholesterol, the selected proportion being adjusted 
for the optimal secreted polypeptide therapy. 

10 For parenteral administration, in one embodiment, the secreted polypeptide is 

formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable 
carrier, i.e., one that is non-toxic to recipients at the dosages and concentrations 
employed and is compatible with other ingredients of the formulation. For example, the 

15 formulation preferably does not include oxidizing agents and other compounds that are 
known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then^ if necessary, the product is shaped into the desired formulation. Preferably the 

20 carrier is a parentereil carrier, more preferably a solution that is isotonic with the blood 
of the recipient. Examples of such carrier vehicles include water, saline. Ringer's 
solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 

25 enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, 
citrate, succinate, acetic acid, and other organic acids or their salts; antioxidants such as 
ascorbic acid; low molecular weight (less than about ten residues) polypeptides, e.g., 
polyarginine or tripeptides; proteins, such as semm albumin, gelatin, or 

30 immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, 
such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, 
disaccharides, and other carbohydrates including cellulose or its derivatives, glucose, 
manose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or 
sorbitol; counterions such as sodium; and/or nonionic surfactants such as polysorbates, 

35 poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a 
concentration of about 0. 1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of 



wo 98/39448 



237 



PCT/US98/04493 



about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes (e.g., 
5 0.2 micron membranes). Therapeutic polypeptide compositions generally are placed 
into a container having a sterile access port, for example, an intravenous solution bag or 
vial having a stopper pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 

10 formulation for reconstitution. As an example of a lyophilized formulation, 10-ml vials 
£ue filled with 5 ml of sterile-filtered 1% (w/v) aqueous polypeptide solution, and the 
resulting mixture is lyophilized. The infusion solution is prepared by reconstituting the 
lyophilized polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or 

15 more containers filled with one or more of the ingredients of the pharmaceutical 

compositions of the invention. Associated with such container(s) can be a notice in the 
form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. In addition, the polypeptides of the 

20 present invention may be employed in conjunction with other therapeutic compounds. 

Example 24: Method of Treating Decreased Levels of the Polypeptide 
It will be appreciated that conditions caused by a decrease in the standard or 
normal expression level of a secreted protein in an individual can be treated by 

25 administering the polypeptide of the present invention, preferably in the secreted form. 
Thus, the invention also provides a method of treatment of an individual in need of an 
increased level of the polypeptide comprising administering to such an individual a 
pharmaceutical composition comprising an amount of the polypeptide to increase the 
activity level of the polypeptide in such an individual. 

30 For example, a patient with decreased levels of a polypeptide receives a daily 

dose 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 
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Example 25: Method of Treating Increased Levels of the Polypeptide 

Antisense technology is used to inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 
5 For example, a patient diagnosed with abnomially increased levels of a 

polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 
2.0 and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period 
if the treatment was well tolerated. The formulation of the antisense polynucleotide is 
provided in Example 23. 

10 

Example 26; Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
15 separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed to 
the bottom of the flask and fresh media (e.g.. Ham's F12 media, with 10% FBS, 

20 penicillin and streptomycin, is added. The flasks are then incubated at 37°C for 

approximately one week. 

At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. 

25 pMV-7 (Kirschmeier, P.T. et al., DNA, 7:219-25 (1988)), flanked by the long 

terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindni and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 

30 using PGR primers which correspond to the 5' and 3' end sequences respectively as set 
forth in Example 1. Preferably, the 5* primer contains an EcoRI site and the 3* primer 
includes a Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and Hindlll fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 

35 appropriate for ligation of the two fragments. The ligation mixture is then used to 
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transform bacteria HB 101, which are then plated onto agar containing kanamycin for 
the puqjose of confirming that the vector has the gene of interest properly inserted. 

The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% 
calf serum (CS), penicillin and streptomycin. The MSV vector containing the gene is 
then added to the media and the packaging cells transduced with the vector. The 
packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequenUy, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
containing the infectious viral particles, is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the media 
from the producer cells. This media is removed and replaced with fresh media. If the 
titer of virus is high, then virtually all fibroblasts will be infected and no selection is 
required. If the titer is very low, then it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is being produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodex 3 microcarrier beads. 

It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 
therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other 
disclosures) in the Background of the Invention, Detailed Description, and Examples is 
hereby incorporated herein by reference. 
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(1) GEa^ERAL INFORMATION: 

(i) APPLICANT: Human Genome Sciences, Inc. et al . 

(ii) TITLE OF INVENTION: 186 Human Secreted Proteins 

(iii) NUMBER OF SEQUENCES: 644 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

15 (B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: Maryland 

20 

(E) COUNTRY: USA 
<F) ZIP: 20850 

25 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

30 

(B) COMPUTER: HP Vectra 486/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 
35 (D) SOFTWARE: ASCII Text 



40 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: March 6, 1998 
45 (C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

50 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: A. Anders Brookes, Esq. 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENCE /DOCKET NUMBER: PS002 . PCT 

(vi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 309-8439 



20 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

30 

GGGATCCGGA GCCCAAATCT TCTGACAAAA CTCACACATG CCCACCGTGC CCAGCACCTG 60 

AATTCGAGGG TGCACCGTCA GTCTTCCTCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA 120 

35 TCTCCCGGAC TCCTGAGGTC ACATGCGTGG TGGTGGACGT AAGCCACGAA GACCCTGAGG 180 

TCAAGTTCAA CTGGTACGTG GACGGCGTGG AGGTGCATAA TGCCAAGACA AAGCCGCGGG 240 

AGGAGCAGTA CAACAGCACG TACCGTGTGG TCAGCGTCCT CACCGTCCTG CACCAGGACT 300 

40 

GGCTGAATGG CAAGGAGTAC AAGTGCAAGG TCTCCAACAA AGCCCTCCCA ACCCCCATCG 360 

AGAAAACCAT CTCCAAAGCC AAAGGGCAGC CCCGAGAACC ACAGGTGTAC ACCCTGCCCC 420 

45 CATCCCGGGA TGAGCTGACC AAGAACCAGG TCAGCCTGAC CTGCCTGGTC AAAGGCTTCT 480 

ATCCAAGCGA CATCGCCGTG GAGTGGGAGA GCAATGGGCA GCCGGAGAAC AACTACAAGA 540 

CCACGCCTCC CGTGCTGGAC TCCGACGGCT CCTTCTTCCT CTACAGCAAG CTCACCGTGG 600 

50 

ACAAGAGCAG GTGGCAGCAG GGGAACGTCT TCTCATGCTC CGTGATGCAT GAGGCTCTGC 660 

ACAACCACTA CACGCAGAAG AGCCTCTCCC TGTCTCCGGG TAAATGAGTG CGACGGCCGC 720 

55 GACTCTAGAG GAT 733 
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(2) INFORMATION FOR SEQ ID NO: 2: 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LEKCTH: 5 amino acids 

<B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

10 

Trp Ser Xaa Trp Ser 
1 5 



15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOUX3Y: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3; 

GCGCCTCGAG ATTTCCCCGA AATCTAGATT TCCCCGAAAT GATTTCCCCG AAATGATTTC 60 

CCCGAAATAT CTGCCATCTC AATTAG 86 

30 



(2) INFORMATION FOR SEQ ID NO: 4: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEtTOTH: 27 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: doiible 

40 (D) TOPOLOGY: linear 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCGGCAAGCT TTTTGCAAAG CCTAGGC 27 



(2) INFORMATION FOR SEQ ID NO: 5: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTK^: SEQ ID NO: 5: 
CTCGAGATTT CCCCGAAATC TAGATTTCCC CGAAATGATT TCCCCGAAAT GATTTCCCCG 60 
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AAATATCTGC CATCTCAATT AGTCAGCAAC CATAGTCCCG CCCCTAACTC CX3CCCATCCC 120 
GCCCCTAACT CCGCCCAGTT CCGCXXATTC TCCGCCXXM' GGCTGACTAA TTTTTTTTAT 180 
5 TTATGCAGAG GCCGAGGCCG CCTCGGCCTC TGAGCTATTC CAGAAGTAGT GAGGAGGCTT 240 
TTTTGGAGGC CTAGGCTTTT GCAAAAAGCT T 271 



10 



25 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GCGCTCGAGG GATGACAGCG ATAGAACCCC GG 32 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotjble 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCGAAGCTTC GCGACTCCCC GGATCCGCCT C 31 

40 



(2). INFORMATION FOR SEQ ID NO: 8: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 12 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGGGACTTTC CC 12 



(2) INFORMATION FOR SEQ ID NO: 9: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) I^EKGTH: 73 base pairs 

(B) TYPE: nucleic acid 
<C) STRANnEDNESS: double 
(D) TOPOIXX3Y: linear 

5 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 9: 
GCGGCCTCGA GGGGACTTTC CCGGGGACTT TCCGGGGACT TTCCGGGACT TTCCATCCTG 60 
10 CCATCTCAAT TAG 73 



15 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleic acid 
20 CC) STRANDEDNESS: double 

<D) TOPOUDGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 10: 
. 25 CTCGAGGGGA CTTTCCCGGG GACTTTCCGG GGAdTTCCG GGACTTTCCA TCTGCCATCT 60 
CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC 120 
CAGTTCCGCC CATTCTCCGC CCCATGQCTG ACTAATTTTT TTTATTTATG CAGAGGCCGA 180 

30 

GGCCGCCTCG GCCTCTGAGC TATTCCAGAA GTAGTGAGGA GGCTTTTTTG GAGGCCTAGG 240 
CTTTTGCAAA AAGCTT 256 

35 

(2) INFORMATION FOR SEQ ID NO: 11: 

40 (i) SEQUENCE CHARACTERICTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GGCACGAGGT AATTTCTACC AGAAATTTCC AGAGCATTAT GTAGGTAGAA AAAAATGCAA 60 

50 GCAAGCTGTT AAAGATCTTG GATCCCATTA TATAGTATGT ATAGCTGAAA TCTGTAATTC 120 

AATCACTTTT TCTCTTTTAT CCTCTAACCA AAAAATTGTT TAATTTTGCA TCCCAAATGT 180 

TTTTAATCTT TGTATATTTT TTAAAAATCC TTTTCTCCTC ATCATTGCCT TTTTTGTGGT 240 

55 

TGTAAATAGA CTTACTTGCA CTTTGAAGAT GAGTTACTCC TTGTCATCTT ACAAATATGT 300 

GATATGGTAA TTTTCATAAC AGATGTCAGT TTTGAACCAA GAATTGGTGA TTTGrTTATA 360 

60 AGAAAAAAAC TGGCTTCATr TCTGTGAAAT TGCTCTTTGA AAATTTCTTT TTACACGTGT 420 
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AAGCCAACTG AGATACCGTG ATGGTGTTGA TTTCTTTCAA TGATCCTTAC CATCTATTTT 480 
AGCCACTGAG CCTTTTATTA TTTGTCTATT TGTAAAGTTT ATTTGTCTTA ACTCATTTAA 540 
TAAATATACT GTTTATCTGT TTCTGAAAAA AAAAAAAAAA AA 582 



10 



20 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GTTTGGGGGT GAGGCCGAGC TGCTGCGGGG CTTCGTCGCC GGCCAGGACA CAGCTACTCG 60 

CACGGCGGCG GCGCCTGGCT ATGATGTTCC TCACCCAGGG CGGGCCTCTG CCCTCTACTC 120 

25 GTGCCAGGCC CACTTGCCAG GCAGGAGCCC TCCCCAAGCC TTCAGGGCTG CTCCSGAGTCA 180 

CCTGTTGGAA TGGACTAAAA GGACCCTTGT GTGGGAACAG GTGCTCCCCA AACACCCTGC 240 

TGCTGGCTGC CAGGCAGGCC CTCTGGAAGG GAAGGGGCAG GACTCATCAG GACCTCCCTC 300 

30 

GACCCCTGCA GGGCAGGCAG CTTGGGCCCG AGCCCAAGCA TTTGGCTCTG CTGCCCCCAA 360 

GGGGACAGGA AGCCTCTTGG GCCTCTTCCC TTCCTGGACA AGGCCCCCTG CCTTTGCCTC 420 

35 ACATAAACTG TACAGTATTT TCATTAAAAG CCTCTTTCAT AAAAA 465 



40 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEENESS: doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

50 ATGCAATTCC TGCTCACAGC CTTTCTGTTG GTGCCACTTC TGGCTCTTTG TGATGTCCCC 60 

ATATCCCTAG GCTTCTCCCC CTCCTAGAAG GGCTTCTTGA TAGATTAGAA AATAAGAATG 120 

AGTGACATTT CCTATGTGCA TATAAGAAGG AGCCACAAGA CATGTCTTTT AAATAAAAGG 180 

ACAGTGTCCA TCCTTTTAGC TGCCGAATAG AACCTTGGTC TCATCCTCCT GGAGCTAGGC 240 

CTTTAAAACA GCTTCTGTGT TTCTCATTTG TCTCAGTGTT TTGCCAGGGT TTTATCGGAA 300 

60 AGATAATGTT* CCGTTTAAAA TATTTCCTAA TGAGGCOGGG CGTGGTGGCT CAOGCCTGTA 360 
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10 



20 



35 



45 



ACCCTAGCAM TTGGGGGCTG AGCX3GGTGGA TCACGAGGTC AGQAGATCGA GACCATCCTG 420 
GSTAACATGG TGAAACCCXX; TCTCTACTAA AAATACAAAA AAAAAAAAAA AAAA 474 

(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 314 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
15 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TTATGTTGGG GAGCAAGACC TGATAGCCAG CCTTTACATG GGAGTATAAT TCTGTCCTCC 60 

ATCTCATAAG CCCCAGTACC TGAGCCAGAA TGATTATAAC CAACCACACT GTCTCTTTAT 120 

CATGGATGGC TTTAGCAGTA GGTTATTTTC ATCATTGCCA TTTGTAGCTC TACAGTGGTT 180 

25 TATAGTAATT TCTCATCTTT TAAGTCTCTC CCTCAGTGCC TGTTGTTATC AAACTCATTG 240 

CTCTCTCANG CAGTTGAGCT CTGCATTCTC CCYTATGGGG GAGAGCTGTG TTGGAGAGAG 300 

AGAATATNAC TTCC 314 

30 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LH^H: 613 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTCATATTGC CGTCTGGCTA AAAGTGAACA TGCCATTGAT CAATCTGCTT TTATTATATT 60 

ATGTTCCTAA TGGTGGCAAG CAAGACAAGA AGTAGAAAGA AAGATGGTGT AAGCTCAAGA 120 

ACCCACTAAA TCTATCCTAT GGCCTGGGTT CACCCAGCCT GCTTTGTGGA TTTTGTCTCA 180 

50 CTATAACAGA GCTCCCAAGG AGACTGCAGA GTCAGCTCCC TTAAGCACTG TAACTAAAGC 240 

CTAACTCTTC CGTTCCACCC AACAATGTYC CCAGCTCATC CTCTITCCCR AAGTCCCCTT 300 

TCTGCCCCAG ATGCGAATTG CATTTAACTA ATCCTCAAGT GAAATGTCCA CACAGRATTC 360 

55 

CATTTTAATT AGCATACCAT AGTTTTTGTG CAAATTTGCT TTCAGARGAC TCCCATTGCA 420 

GCTGCTCAGA GACGCTAAWG GCAGGGCCTC TTGAWGCTTT CCCGATAGCT TTCAGCTGCA 480 

60 ATAGCTCTTA GGCAGAATGC CATGAGCGTC CTGCCCAACT GTATTACTQG GGAACACCTG 540 



t 
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ATTGGCTAGA AGTTGATCCT CCTGTAACTT TTCTGAGTTC TTTACATTTA CTCGTGAAAC 600 



CCAAATATGC CAC 



613 



10 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
15 (D) TOPOIiOGY: linear 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CCCCCCCCAT TGAACCCTGG GCTGTGAAAG TTTTTGCCTG TGTOGGTCC3T TCTCyiX3TOGC 60 

GCCTGGTGTG TGGKTCCCAA CTCCTGTTGC AAAGTGGCAG CAGCCAATCA TGAAGCGCCC 120 

TTATTOTTAG TTGCAGATGA CCAGGTCTCC CCCCCACAGC CTCTC3TCTGG TCCCTCATTC 180 

25 GTGAGTGGTC TGCCTGCCCA AOGAGCCTGA TTGGTGGGAA ATGGCATCAT CTAATATGAT 240 

GGGAAGGCAT TTGGTCCTGG TTATGTTTAT TACAACATCA TTGCACTCTG GGACTCCAGT 300 

CCCTGAAAAC GTAATTTGTG GTGTTACCAA AGGACCACAG GGGAAAAAAA AAAAAA 356 

30 



35 



45 



55 



<2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GAAACTANAT CCCGGGGCTT TTAACNGGTA CTTQGGAAAT AAGTATTGGG TAATCACTAA 60 

GNGGACATTG ACTGCACCAA ACCAAAGCTA TAGAAAGAAA TGATTGACTT TTTAAAATAT 120 

ATTCACATTA ACTGTCCTAG GATACTTCTC TTGAGGCTTT GGAAAACTTC TTCCTTGAAA 180 

50 TTTGCATATC CACTCXy\GTT CTGTCACCAA AGATTTTAAT CTTCAGATCG CAAOTTCCTC 240 

TCTCCCAGAA AAAAGTACTA CAACAGGCTC AAGGGATATG CTTTGGTGGT CAAGGGATTA 300 

CACTATGGTT TTCCTTCTGT TCACAATGGT ATTTACAGGA GACCTTGTCA TCAGAGGACG 360 

TACTGAACTA TCTTTATGAC TTTGGATTTG ATCAGAGGTT TAAAAAAAAA AAAA 414 



60 
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(2) INFORMATION FOR SEQ ID NO: X8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOIOOT: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AATCACCATT GCAATACAAA TGATCTGCCT GGTGAATGYT GAGCTGTACC CCACATTCGT 
CAGGAACYTC GGACTGATGG TGTGTTCCTC CCTGTGTGAC ATAGGTQGGA TAATCACCCC 
CTTCATAGTC TTCAGGCTGA GGGAGGTCTG GCAAGCCTTG CCCCTCATTT TGTTTGCCSGT 
GTTGGGCCTG CTTGCCGCGG GAGTGACGCT ACTTCTTCCA GAGACCAAGG GGGTCGCTTT 
GCCAGAGACC ATGAAGGACG CCGAGAACCT TGGGAGAAAA GCAAAGCCCA AAGAAAACAC 
GATTTACCTT AAGGTCCAAA CCTCAGAACC CTCGGGCACC TGAGAGAGAT GTTTTGCGGC 
GATGTCGTGT TGGAGGGATG AAGATGGAGT TATCCTCTGC AGAAATTCCT AGACGCCTTC 
ACTTCTCTGT ATTCTTCCTC ATACTTGCCT ACCCCCAAAT TAATATCAG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCCCCCCCCC CCCCCACACT TTCAGGAGTC ACCCCCCAGC ATTTGGGGTT GGGTTCGCCC 
TACTCCAGCC TGGAGCTCCC TGAGGGAGCC TGCACTCCCT GCTCCCAATC CCCGCTACTG 
GTGCAGGGAT GCAGCCTGGA GCTGGCGTCC TTGTTCTGGG CCTGCTGCTG CCGCCACCCC 
AGAGCCCCAG CCTGTCCTQA ATTGACATCA GTGCTTCCCT GAACTGCCTC CCCCACCCCT 
GGGCATTATC CCAGGAAACT TTATGTTTTC TAGAAGCTAA GCAGCTGCTG GGACTCAGGG 
ACTGGTGCAG GTAGGCTGAG TGGCAGCTCA GTCCTAGAAG GTCTCTGAAG ATCTGGACTG 
AGGACCTTGC TACTCCCCAA GCCAGAGCCC ATCAGCCAGG CCTGCTGTGA GCCACCTGCC 
TGTGGAGTGC TGAGCTCAAC CAAAGGCTGG CAAGCTCTGG GCCTCATTTA AGGGATTCTG 
ATGAGCCGAT GGGCCCTGGA GGCAGCCCAT TAAAGCATCT GGCTCGTTTT TGGAAAAAAA 
AAAAAAAAAA 
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10 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOXjOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TCTTGAAGAG TGTACAGTAC AGGATTATTA TAATGAAAGT TTATATCAAC AGGOTITCGT 60 

15 TGGCTCTGCA TATATTATAA GCAAAAGAGA TTGGTAAAGT GCCACAGTAT TCCAGATAAC 120 

TTTTCAGTTG CGGCCTTTCT TCTCGTTCTT TAATTTGAAA CCTAGATACA TCCACTAAAA 180 

ACTAGGAGAA TGACTTTTAC CCTTGGGGAC AGCCAAGTTT TGTTGATAAA CCTATITCCT 240 

AGCATGCCTT CAGGAAGTTG TGCCAGACCC TAGATTGTGA AGGACOCACT GTTCTTCTCT 300 

TGTACGAGCT CCCTGAACCA TTGTTCAGAG GACCAATGTC ACATCGCrrC AOCGGCATOG 360 

25 NCCATGGGAG CATCTGGGTG ATAYCTGTCT ACAGTATTGG CTCTTCTGCG AGGCTGATAC 420 

ACAAGGCCTC TCTTCCACAT GATCATTTGC AAACCTCCCC CAGCCCCTAC CATCCAATGT 480 

GGAAGGAAAA CAAGAACTGC CTGAAGAAGA GTCCAAGCTA CAGATACACA GCGTOTGCAT 540 

TGCGGCTGTC ACCTTCCTCC TCCCACTTCT GTATCCTCAG AGATGCTGCG TGGAaXSOTrC 600 

CTTAACCTCA GCTGACTTCC CTGTGAATGT CTAATGCTAG TTCAGGGCCT CCAGGCATIG 660 

35 atttgtacag tggtaactcc caatgaggct tctgttatca tttggtgtgc TrryrcTGrc 720 

attaaaagaa atgattttcc c 742 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEENESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGCACGAGTC TCCCCTGGGG AAGTTTTTCT TTTTCAGGAG GGAGGAOGGC TTTCCCAGGT 60 

AATGTGTCTA GAGTGTTGGG CAGAAAATCT GGGACCACAC CACACCAGTT CTCTCCTTAA 120 

TCCACGTCAT TTGCCTTCTA TCCCAGCTAT GTTTCCAGTG TCCTCTCGGT GTTTCCAAGA 180 

GCAACAAGAA ATGAATAAAT CTCTGGTGAG TTGTTTATTT GTTCTTCACT TTGTITTACA 240 

CTGTATTTTC TGAGTTTATG GGTGTCTGTG AATTAAAAAG GAAAAGTAGA AATAAGTAAA 300 
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ACTCAGGTTG AAGGAAATAT ACATAAATAA GATAAAGCTG ACCTGTAGAT ATAGCAGGTT 360 

ATAAAGCTTA GAGTTGTCTA AGTTGAGTGC AAATTTTCCT CTGATCTTTC TGATGCXX3AA 420 

5 

CAAAAAAGCA GTCATGTTTG TTAOXSTGATT GGAATGGAAC CCX3AGAAGAG AGCATGCTGT 480 

GTTCTTGrPGG GACAGGAAAG CTTGCGTC5CA CXHAAGTCTGA ACCACCACCT TCATGGTGAC 540 

10 ATAGATTATG TCCTC5GAACA TATTTCACAC CGGCCTGGCA CTAAACACTT GTAGTGTTCT 600 

GCAGTGGAAA CGGTCATCTT CCX3CTAAAGC ACGGCGTCTT GTGCAGCGGA AATGGTCATC 660 

TGCTGCTAAA ACACAGCTTC CATCGTAATG TATGCTCCTT ACTCAAAGAG TGTGGTCCCA 720 

15 

AACAGCCTTT GGGAGGTCCT CCTTGATTCA TGGATGAAAC CTGGAACATC TTGAGGACTG 780 

AGTTAACXrAT AGGTCCTTAA ATAACTCTCC ACACGTTTTT CTTAGTTTAT CTCTACATGC 840 

20 AGGGTGTGCA GCAGCCTGTT CAAAGTCATA TTTTCTGGGA AATATTTCCA GTGTTTATTT 900 

GCACTTTAGC CCACTCTGTG TAGCXrTTATT TCTTCTAAAC ICACCATTAA TCTGAATAAT 960 

AGTCAAATTT AGGGGGACTG TATTTGCCTT A 991 

25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
35 (D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CCACGCGTCC GGAATTCCCC TGAGGATCTT GGGCTATCTT TGACAGGGGA TTCTTGCAAG 60 

TTGATGCTTT CTACAAGTGA ATATAGTCAG TCCCCAAAGA TGGAGAGCTT GAGTTCTCAC 120 

AGAATTGATG AAGATGGAGA AAACACACAG ATTGAGGATA CGGAACCCAT GTCTCCAGTT 180 

45 CTCAATTCTA AATTTGTTCC TGCTGAAAAT GATAGTATCC TGATGAATCC AGCACAGGAT 240 

GGTGAAGTAC AACTGAGTCA GAATGATGAC AAAACAAAGG GAGATGATAC AGACACCAGG 300 

GATGACATTA GTATTTTAGC CACTGGTTGC AAGGGCAGAG AAGAAACGGT AGCAGAAGAA 360 

50 

GTTTGTATTG ATCTCACTTG TGATTCGGGG AGTCAGGCAG TTCCGTCACC AGCTACTCGA 420 

TCTGAGGCAC TTTCTAGTGT GTTAGATCAG GAGGAAGCTA TGGAAATTAA AGAACACCAT 480 

55 CCAGAGGAGG GGTCTTCAGG GTCTGAGGTG GAAGAAATCC CTGAGACACC TTGTGAAAGT 540 

CAAGGAGAGG AACTCAAAGA AGAAAATATG GAGAGTGTTC CGTTGCACCT TTCTCTGACT 600 

GAAACTCAGT CCCAAGGGTT GTGTCTTCGG AGGCATCCAA AAAAAAAAAA AAA 653 

60 
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^ (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGCAGGCTGA CGACCTGCAA GCCACAGTGG CTGCCCTGTG CGTGCTGCGA GGTQGGGGAC 60 

15 

CCTGGGCAGG AAGCTGGCTG AGCCCCAAGA CCCCGGGGGC CATQGGCGGG GATCTGGTGC 120 

TTGGCCTGGG GGCCTTGAGA CGCCGAAAGC GCTTGCTGGA GCAGGAGAAG TCTCTRGCCG 180 

20 GCTGGGCACT GGTGCTGGCA SGARCTGGCA TTGGACTCAT GGTGCTGCAT GCAGAGATGC 240 

TGTGGTTCGG GGGGTGCTCG GCTGTCAATG CCACTGQGCA CCTTTCAGAC ACACTTTGGC 300 

TGATCCCCAT CACATTCCTG ACCATCGGCT ATGGTGACGT GGTGCCGQGC ACCATGTGGG 360 

25 

GCAAGATCGT YTGCCTGTGC ACTGGAGTCA TGGGTGTCTG CTGCACAGCC CTGCTGGTGG 420 

CCGTGGTCGC CCGGAAGCTG GAGTTTAACA AGGCAGAGAA GCACGTGCAC AACTTCATGA 480 

30 TGGATATCCA GTATACCAAA GAGATGAAGG AGTCCGCTGC CCGAGTGCTA CAAGAAGCCT 540 

GGATGTTCTA CAAACATACT CGCAGGAAGG AGTCTCATGC TGCCCGCANG CATCAGCGCA 600 

ANCTGCTGGC CGCCATCAAC GCGTTCCGCC AGGTGCGGCT GAAACACCGG AAGCTCCGGG 660 

35 

AACAAGTGAA CTCCATGGTG GACATCTCCA AGATGCACAT GATCCTGTAT GACCTGCAGC 720 

AGAATCTGAG CAGCTCACAC CGGGCCCTGG AGAAACAGAT TGACACGCTG GCGGGGAAGC 780 

40 TGGATGCCCT GACTGAGCTG CTTAGCACTG CCCTGGGGCC GAGGCAGCTT CCAGAACCCA 840 

GCCAGCAGTC CAAGTAGCTG GACCCACGAG GAGGAACCAG GCTACTTTCC CCAGTACTGA 900 

GGTGGTGGAC ATCGTCTCTG CCACTCCTGA CCCAGCCCTG AACAAAGCAC CTCAAGTGCA 960 

45 

AGGACCAAAG GQGGCCCTGG CTTGGAGTGG GTTGGCTTGC TGATGGCTGC TGGAGGGGAC 1020 

GCTGGCTAAA GTGGGKAGGC CTTGGCCCAC CTGAGGCCCC AGGTGGGAAC ATGGTCACCC 1080 

50 CCACTCTGCA TACCCTCATC 7VAAAACACTC TCACTATGCT GCTATGGACG ACCTCCAGCT 1140 

CTCAGTTACA AGTGCAGGCG ACTGGAGGCA GGACTCCTGG GTCCCTGGGA AAGAGGGTAC 1200 

TAGGGGCCCG GATCCAGGAT TCTGGGAGGC TTCAGTTACC GCTGGCCGAG CTGAAGAACT 1260 

55 

GGGTATGAGG CTGGGGCGGG GCTGGAGGTG GCGCCCCCTG GTGGGACAAC AAAGAGGACA 1320 

CCATTTTTCC AGAGCTGCAG AGAGCACCTG GTGQGGAGGA AGAAGTGTAA CTCACCAGCC 1380 

60 TCTGCTCTTA TCTTTGTAAT AAATGTTAAA GCCAGAAAAA AATAAAAAAA AAAAAAAAAA 1440 
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AACTCGAGGG GGGCCCRKAC CCAATCWCCC TATAGTAKAC GTANNN 1486 



5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2323 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEE^JESS : double 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CTTCGCCGTT TCTCCTGCCA GQGGAGGTCC CGGCTTCCCG TGGAGGCTCC GGACCAAGCC 60 

CCTTCAGCTT CTCCCTCCGG ATCGATGTGC TGCCGCCGCC GCCGCCGCCG TCCCGCGTCC 120 

20 

TTCGGTCTCT GCTCCCGGGA CCCGGCTCCG CGCAGCCAGC CAGCATGTCG GGGATCAAGA 180 

AGCAAAAGAC GGAGAACCAG CAGAAATCCA CCAATGTAGT CTATCAGGCC CACCATGTGA 240 

25 GCAGGAATAA GAGAGGGCAA GTGGTTQGAA CAAQGGGTGG GTTCCGAGGA TGTACCGTGT 300 

GGCTAACAGG TCTCTCTGGT GCTGGGAAAA ACAACGATAA GTTTTGCCCT GGAGGAGTAC 360 

TTGTCTCCCA TGCCATCCCT GTTAATTCCT GGATGGGGAC AATGTCCGTC ATGGCCTTAA 420 

30 

CAGAATCCCC CAGATGGCTT CATGGCCCCC AAAGCATGGA AGGTCCTGAC AGATTATTAC 480 

AQGTCCCTGC AGAAGAACTA AGCCTTTGGT CCAGAGTTTC TTTCTGAAGT GCTCTTTGAT 540 

35 TACCTTTTCT ATTTTTATGA TTAGATGCTT TGTATTAAAT TGCTTCTCAA TGATGCATTT 600 

TAATCTTTTA TAATGAAGTA AAAGTTGTGT CTATAATTAA AAAAATATAT ATATATATAC 660 

ACACACACAT ATACATACAA AGTCAAACTG AAGACCAAAT CTTAGCAGGT AAAAGCAATA 720 

40 

TTCTTATACA TTTCATAATA AAATTAGCTC TATGTATTTT CTACTGCACC TGAGCAGGCA 780 

GGTCCCAGAT TTCTTAAGGC TTTGTTTGAC CATGTC3TCTA GTTACTTGCT GAAAAGTGAA . 840 

45 TATATTTTCC AGCATGTCTT GACAACCTGT ACTCTTCCAA TGTCATTTAT CAGTTGTAAA 900 

ATATATCAGA TGTGTCCTCT TCTGTACAAT TGACAAAAAA AAAAATTTTT TTTTCTCACT 960 

CTAAAAGAGG TGTGGCTCAC ATCAAGATTC TTCCTGATAT TTTACCTCAT GCTGTACAAA 1020 

50 

GCCTTAATGT TGTAATCATA TCTTACGTGT TGAAGACCTG ACTQGAGAAA CAAAATGTGC 1080 

AATAACGTGA ATTTTATCTT AGAGATCTGT GCAGCCTATT TCTGTCACAA AAGTTATATT 1140 

55 GTCTAATAAG AGAAGTCTTA ATGGCCTCTG TGAATAATGT AACTCCAGTT ACACGGTGAC 1200 

TTTTAATAGC ATACAGTGAT TTGATGAAAG GACGTCAAAC AATGTGGCGA TGTCGTGGAA 1260 

AGTTATCTTT CCCGCTCTTT GCTGTGGTCA TTGTGTCTTG CAGAAAGGAT GGCCCTGATG 1320 

60 
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CAGCAGCAGC GCXTlGCTGTA ATAAAAAATA ATTCACACTA TCAGACTAGC AAGGCACTAG 1380 



AACTGGAAAA GACCACAGAA AACAAAGAAT CCAACCCTTT CATCTTACAG GTTGAACAAAC 1440 
5 TGTGATGATG CACATGTATG TGTTTTGTAA GCTGTGAGCA CCGTAACAAA ATGTAAATTT 1500 



GCXy^TTATTA GGAAGTGCTG GTGGCACTrGA AGAAGCACCC AGGCCACTTG ACTCCCAGTC 1560 



TQGTGCCCTG TCTACACCAG ACAACACAGG AGCTGGGTCA GATTCCCCTC AGCTGCTTAA 1620 

10 

CAAAGTTCCT CGAACAGAAA GTGCTTACAA AGCTGCCTTC TCGGATACTG AAAGGTCGAG 1680 



TTTTCTGAAC TGCACTGATT TTATTGCAGT TGAAAAAAAA AAAAAGCTAT TCCAAAGATT 1740 
15 TCAAGCTCTT CTGAGACATC TTCTGATQGC TTTACTTCCT GAGAGGCAAT CTITITTACIT 1800 



TATGCATAAT TCATTGTTGC CAAGGAATAA AGTGAAGAAA CAGCACCnT TAATATATAG 1860 



GTCTCTCTGG AAGAGACCTA AATTAGAAAG AGAAAACTGT GACAATTTTC ATATTCTCAT 1920 

20 

TCITAAAAAA CACTAATCTT AACTAACAAA AGTTCTTTTG AGAATAAGTT ACACACAATG .1980 



GCCACAGCAG TTTGTCTTTA ATAGTATAGT GCCTATACTC ATGTAATCGG TTACTCACTA 2040 
25 CTGCCTTTAA AAAAAAAAAC CAGCATATTT ATTGAAAACA TGAGACAGGA TTATAGTGCC 2100 



TTAACCGATA TATTTTGTGA CTTAAAAAAT ACATTTAA/A CTGCTCTTCT GCTCTAGTAC 2160 



CATGCTTAGT GCAAATGATT ATTTCTATGT ACAACTGATG CTTGrTCTTA TTTTAATAAA 2220 

30 

TTTATCAGAG TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2280 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 2323 

35 

(2) INFORMATION FOR SEQ ID NO: 25: 

40 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



GGCACGAGCC TGTGTGGTCA TGTTCCTCGT GGTGCAGTAC CTGACATGAG CCAGCCACGC 60 

50 TCAGTGGCTG AACAGCATTC CCACAGCCTG CAAGTGTGTG TGTGTGTGAA AGAGAGAGGG 120 

GGGCCCAGAG CCGCCTTTTG AAATGTTTGC CTGTCTGAAC TGTGAAGACA CTTGGGAGTG 180 

ATTGTGGTCT AATTTCCAAC CTGCTCTGTT TTCTGTGACA TCTTGGAGGG GAGCTAGTGC 240 

55 

CACACCATGC GCGGTGCTTA GAAATGAAAA A3TCCCGGGT CTGTCTCTCT CACTCTOGCT 300 

CTCATGGGGG AGGGAAAGAA TQGdTTGGT GGCTTTGTTC ACACAGCTGA TGCGTGCTGG 360 

60 GAAGGTGTCC ACAGTGAGCC TGTGTGCAGG ACTGTCCACA CGGTTCACAC TTGTCACCAT 420 
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CAGGCCTTTC TQGTCCTGAT AGGGTGGAGC AAAAGTGGAA AQGAAAGGAA AGAGGCTTTT 
CTCACAGCCA TTATATTAAA TAGTAGGTCG ATTCACATCT CGTOCTCCTG GCCACCTTCC 
CCTGTGCCTC AGTGACATGT AGATGACTGA CTGCCAATAC TTGTCACCAT TCCCTQGAAG 
CAGCTACCTA GGGGAAACAA GATGTAGTGC TATTGCXX3AT AACAAGTAAG ATTTICCACA 
CTAAAAAAAA AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2036 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIXDGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTGAGAAAGG AAAGCATTCG GATCTGCTGC AAAAACACAT ATATCCATAA AGACTCATGT 
TATTCAGAAA ACAGATTGTG AACACAATCA CATTCGCATG AATCCTTTAA AAGGAAGAAG 
ACCTTAAAGT ATCTGCAAAT CTGAATTTCT ATTTATTCCT TCACTGAATA TAGAAACAAT 
GGTTATCTGA TTATTAGAGA TATTATTTTG GATATGTTAC TTATTAACTT GCTATGGCTG 
GTAACCATGA- TAAAGTCTGT TATTAATAAC AACATAATTC TTTTTTTAAA GAAGAAAAGC 
TTATTTTTCA TTGACAGTGT ATAGATTTAT CTACTTAGTT GTGTTTTGCT ATTAGTGTTT 
TAATTTTTTT TTTAAGTTGA GTGTTTGATA AATTTTAAGA CCCTGTCCCC ACCTTGTTTT 
GAGTCCTGTG TTGACTACAG GTATATAGCY CAWTTTAAAA ATCCTAAAGC AAAAGAATTT 
TATTTATAAA AGAATCMAMC MGTTGCATGC ATGAGGCTGT GAAGTCAGAT ATTTAGTAAT 
AAAAGCAGCA GTGCCTTTTT TTGTATTTAC CCATTGACCC CCACCAAATG CAACTGTTTT 
ATATTAAGAA AATAGTAACA ATTTTAAAAT CTCAGAGTAA AATCTATTTC ACTACATGCT 
TTTCCCCCCT TGTTCTGATT TAAGCAGTGT GTACTTGGCA TCTCTACATT GTCCTAGGGA 
CAGTGGTGTT CTACAATATT ATCATGTATG ATGnTTATT GGTGCTTTTT ATTCATAGTG 
GCTTCTTACC AGAAACAGTA GGAAGAAACA CATGTVACTGT GTACAAGACA TGAAACATTG 
CTGCTGATAT GTTGTTTTTT CACATGCTTT TGAGTTTTCA CTTTTTAAAC GAGAGCCAGC 
AAGCAAAATA GATGTGGCTG GGTCTGCCTG TCCGGGCGGC TYTTTGCACC GAGCTCTCAA 
ATCCTGTGTA TTGAGGGTTC CTTTTTQGTA CTCAGGATTG GAGCTACAGC TGGGCCCCCC 
TCTCTCCCAT TCGTTTGAAG AGACACTGAG GGAAACAAGG GTTTCTTTTG AGGTGTCCTT 
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GGCTGCCTTT TACGGGATGG GAGCCTTCTC 
GCTACTGCCG GTGCAAGGTT GTAGATGTTA 
5 CTGGGCTGAA TGCAAAAGCA TGCAACCAGA 
GCCTCATTGG TCCCCTGGAG ATGTCTGTAG 
AGCCTGACCA AGGCGCTCAG GTGTGCCTGT 

10 

CACATGGAGC TGTGATAACA CTAATGTTGA 
GTTTGCAAAG TGAGTTTTAT TTTTTTGTAA 
15 TATTCCTCTG GGAGGAATAG GAAGAAAACA 
CCTCCCTTAT TAATATATAA TCYTCATGTA 
AAGCTTTCTT TTC5TTGCATG CCCTGTGCAG 

20 

CCTGTTTTCC TGTATAAAGT TAGTGAACAA 
AAGCAATGCA TATTTTTTAA ATTTGTCATA 
25 AGCITTACTG ATATACAGAT ATACTAATGT 
TTTTCAAATG TTGTTACCAG TGAAACACCC 
TTATTCATTT CCTCCCATGT AACTAAGAAT 

30 

GAAAGTGAAG GGAAATGATT AATACAAGGT 



255 

CGGATCTTTT GTTCTTCTGC ACCTCTTGTA 1140 

TTCCCCAGGA GCCTGGGCTK GGGGGCTGAG 1200 

AGGCGGGCAA OGGGAGGAAA AGCAGGCXTTG 1260 

CAGTCAGCTC CAGCTTGC3GC CTGGGGAAGC 1320 

TACAAGAAGA ACCTGCAGAA GGATAATTTG 1380 

TTTTTTTTTT TTTTACAAGT CATCAGRGAT 1440 

TTCCTTTATC TTTACTTAAA GGTGAATGTG 1500 

GGAATGTTAA TAATGTCGAA CAGAAAACTT 1560 

TTTATGOCNT AATGTAAC3CT GACTTTTAAA 1620 

GCATCTGTAT TGTACATGCA TGCCTTTCGT 1680 

AGAAATATTT TTGCCXTTAGT TCATGTTGCX: 1740 

TATGGAAAGA GCATGTTTGT TACATGTAAA 1800 

TTGAAGATGC TGTTCTTTGC AAGTGTACAG 1860 

TTGTGGTTTA AACTTGCTAC AATGTATTTA 1920 

CATGGCTATA TTTCATATCA ACGTTATATT 1980 

TTTGTAACAA AAAAAAANAA ANNAAA 2036 



35 

<2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

45 

GGCACGAGAT AACATAQGCA CAATAATACT GTATGTCTAC TTCTAGGATT ATAAGGAATT 60 

AACATTGAGA TGACATOTCC ATTTGAGAAG AAAATAGTTG CTTTCAGTGC CTTTTATTTG 120 

50 ATTCCTGGAG AGAGCAGACT CGCACCAACA TTCAACCCCA GCGCTGATAT GACAGTAATC 180 

CTCAGAGGCA GAGCCCAGCA CAAAACAGCA ATGCTAGAAA GTTACAATTG GAAAGTTTCC 240 

TGCCAGCTTC GGGAATGACA CTGCAAAGCT GATGCCAGAA ACTGCCAGAG TAATTCTCCT 300 

55 

CATTACTGCT CTACCCACCC ACTTTCAGCT CCCCAAATTA ACTAGTGCAG TTGACTAATC 360 

CTCTTTACCT TTATCATTTA GGTGAGGCAT TGCACAAAAA CTCTCGACTT TGCCATATAA 420 

60 GGGCTGTGGT TCTCTGTGGT CCTGGATAAG AGGCATCACC ATTATCTGGA AACATGCAGT 480 
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AAATGCAGAT TCTTCATCTT CTCCCCAGAC CTCCTGAGTT AGAAATTCAC AAGTTCICCA 540 

GGTGATCTCA TACATGCTAA AGTTTQAGAA CCATTGAGTA AAGTTAATGC ATTAAGAAGA 600 

5 

GATTAGATAG GGATQGTQGC GTATCTTCCT ACAGTTTCCC TGTTAACAAG AAAGTCAGAG .660 

GTCAGTTGAT CAGACATTAG ATTATTTATT GCTAAAACTA AAAAAAATTA AAAAAAA 717 

10 



(2) INFORMATION FOR SEQ ID NO: 28: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GAATTCGGCA CGAGCAGCAT CCTAATTTTA GTTTGGAGAT GCATTCTAAA GGATCTTCTC 60 
25 TATTGCTTTT TCTCCCACAA TTAATdTGA TTCTGCCTGT CTGTGCACAT TTGCATGAGG 120 
AACTGAACTG TTGTTTTCAT AGGTAAATGA GAGACTGAGT TTTTTCATTT CTGAAGAGAA 180 
AGGGCATTTG CTCCTACAAG CTGAAAGGCA CCCCTGGGTG GCTGGGGCCC TCGTGGGAGT 240 

30 

TTCTGGGGGA TTGACCCTTA CAACATGCAG TGGCCCTACA GAAAAACCTG CAACTAAAAA 300 
TTATTTTTTA AAAAGGCTCC TCCAGGAAAT GCATATAAGG GCTAATCACC CAGTATITTG 360 
35 ARGCTTCGAA GARGTAATAR AMCCCTGGAG AGAGAAACTG AGACATGTAA GAGGGTGGGA 420 
ATGACTCAGT GGTGGCACAC TATGGAGTCC TGCCCACAAG TAGCACACAT CAACCCACTA 480 
CACAGAAATC CTAQG 495 

40 



(2) INFORMATION FOR SEQ ID NO: 29: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

AGCTTAACGT CATGATTCAT TAGGGGAATG CAAGGCAAAA CCATGATGAG AATOCCCCTA 60 

55 

GACACCTCTT AGAAGAGCTG CTAGAAAGGC AGACAGCACC AAGCGCTTAA ATGAGATGGG 120 

GGCACTGGTG CTTCTTCTGT GCCTACTGGT AGGGGTGCAG CAGAGTGGTT CAGTCTGGGA 180 

60 CAGTTAGCTG GACATCACGT GGACCCAACA CACGCATTTC CTGGGTTACT TACCAAQGAG 240 
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AATAGAAAGC AGGCAGATCT TTACAGCAGC TCTTACCTGW TTGCAAAACA ATGGAAATGC 300 

^ CCACATGTCC ACAAACAAGT KTGTGGTCTG CXTOTGCCAT GAAGCACAGT GTCQCTGAGC 360 

GTCAAGAGTC CCCACACTCA AAGGAGGCAG CAGATACAGG GCTGCACACT GTGTGATTCC 420 

ACACATGTGA CATTCTGGAC ACGGACATGC TCGATGGCAA AACGAGCATC GGGCTGAGAG 480 

10 GACTGCTGAG AAGGGGAACG GGGCTGCTGG GATGTGGGTT GATTGTAGCA GTAGCTCAOG 540 

GAGATGTGAC CTCAAA 556 



15 



40 



45 



55 



(2) INFORMATION FOR SEQ ID KK): 30: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 434 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CTAAATGGTG ACTGTGGCTT TGTCGAGACA GGCCCCAAAT GGTAGGTGTG AACACAACAT 60 

GCACAGAATG AGGAGACATG CAGAGTGCTG AAATACTGTC CTGGACAGAT GTGTTACATG 120 

30 

ACTTTCTTTT CAGCTTATTT CTGTGGCCTG CCTTTGAAGA TAGAGCTTTG TTGATATTTA 180 

CATTAAACCA AATTGTATAA YTATGTTCCA TTCTGACATG TTATTTAGCA AARGAAAAAR 240 

35 GAGTAATTCT ACATCAGCAT CTTTAGTGCA TGCTAAAAGA TTAAAAATGT CTTTTGGGGA 300 

ACATGTTTTG TATACATAAA TGTTTAGATA GAAATATTTA TAGAATNCTC TATGTGAGTA 360 

TTNATCTCCC TATGTATATT TATATCTAGA TGTGTCAATC TTTGTATTGA TATGAAATGC 420 

TATGAATAGT GAGA 434 



(2) INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 31; 

CCACGCGTCC GATCTCACAG CTCCGACACT ATTGCGAGCC ATACACAACC TGGTGTCAGG 60 

AAACGTACTC CCAAACTAAG CCCAAGATGC AAAGTTTGGT TCAATGGGGG TTAGACAGCT 120 

60 ATGACTATCT CCAAAATGCA CCTCCTGGAT TTTTTCOGAG ACTTGGTGTT ATTGGTTTTG 180 
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CTGGCCTTAT TGGACTCCTT TTGGCTAGAG GTTCAAAAAT AAAGAAGCTA GTGTATCCGC 240 

CTGC3TTTCAT GGGATTAGCT GCCTCCCTCT ATTATCCACA ACAAGCCATC GTGTTTGCCC 300 

5 

AGGTCAGTGG GGAGAGATTA TATGACTGGG GTTTACX5AGG ATATATAGTC ATAGAAGATT 360 

TGTGGAAGGA GAACTTTCAA AAGCCAGGAA ATC3TGAAGAA TTCACCTGGA ACTAAGTAGA 420 

10 AAACTCCATG CTCTGCCATC TTAATCAGTT ATAGGTAAAC ATTGGAACTC CATAGAATAA 480 

ATCAGTATTT CTACAGAAAA ATGGCATAGA AGTCAGTATT GAATGTATTA AATTGGCTTT 540 

CTTCOTCAGG AAAAACTAGA CCAGACCTCT GTTATCTTCT GTCAAATCAT CCTACAAGCA 600 

15 

AACTAACCTG GAATCCCTTC ACCTAGAGAT AATGTACAAG CCTTAGAACT CCTCATTCTC 660 

ATGTTGCTAT TTATGTACCT AATTAAAACC CAAGTTAAAA AAAAAAAAAA AAAAA 715 

20 

(2) INFORMATION FOR SEQ ID NO: 32: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOI/X3Y: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GAGCCAGTGC CGGCGAAAGG GGACCTTCCT CTACTTCCTG CCACAGACCC TGTCCCCACA 60 
35 CACTTCCTGC CCCTGCTCTG CTGGGAGGCC ACTTCCTCCC CCAGTGCTGG ATTCCACCCC 120 
CAGCTCACCC TCAAACATGG CCCCCTCTCT CCTCCTGCTT GCCCCTCTCT GCTCCCTGGA 180 
GGCTGTTCTG TCCTCCCCTC TTGAAAAGCA ATGCCAGCTT CCTGGGATCT TCTGCCAACT 240 

40 

CCAGCTACCA TGCCCTTTGC TCCTGTCAGC TCAGCTCCTC AAGGGAATTG TCTAMCCTCG 300 
GTGTCCTGCT TCCCTCCCTC AACCTCCTCA CCCTGCTCCA AGCTGGCATC TGCCCCTCCA 360 
45 CTGCACAGAA CGGNTCCCCC ACCACCTGCC TTTACAGGGA GGAAGCAGCA ACATGGAAGA 420 
ANCGAACTAT AGGGGCTACA ANGATGCTCA GCTCTGATCC CGAAGGCAAA AAGNATCTTT 480 
GGGCAC 486 

50 



(2) INFORMATION FOR SEQ ID NO: 33: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIMESS : double 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TAATAATTAG GTTATTCCCA GAAGCACAGT GTCATTCTTT AAATAAAAGC 60 

TTTCCTGTTT AAAGCTTTTC AAAGGAGCAG ACCACCTTGA AGATTCCCCC TAGGGTTGAT 120 

ATCTGTCTAA TTCATTTTAT AAAAATTATT CTTGTdTCA TTTTAAAGCT TTGGCTATAT 180 

10 AGTCAGAAAT GTCCTAAATA ACAAACTATT TTGTATTTAA TTTAGGGAAG ACTAAAGGGA 240 

AGAAAAATGA AAACTCAGTC TTTATGTAAG CTCCAAGGAT ATTAGGGCTT AAAGGGCTTT 300 

TCTAGTTTTA TGAGAATTTG TACTACTGAT TTTTATATAT TCCTGTTTTT GATGAACAGA 360 

TCTCTGGGGA AATTGTTGAG TTACAATGGC ATTTCACTGT GATCCCTCTC AAGCTCAGAT 420 

CAGTTCTATA ACCCAATGAC AACCTGTCTC TTTGGTTrAC TGTCCTGTGA AATGTCAGCT 480 

20 CAAGTTTCCC AGAAGTCGTG TGTTTATGAT GAGTCAGAGT GCTTTTCCTC GGTGGGACAG 540 

TTGCTGGCCC TCTTAATTTT GGTGTATGTG CTTCCAAC3TA TCTAAACCTC CAGTCTGATC 600 

TGTATATGCT ATCCTAACTG TTAATTGTAT TATTGATTAT GTTGATTATC TTGCTTGAAG 660 

25 

GTTCATACTT TTCAATTTGA TAGAAATAAA GTTTTTTTCT GCTTATAAAA AAAAAAAAAA 720 

AAAAA 725 

30 

(2) INFORMATION FOR SEQ ID NO; 34: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CACACAGCAT GCTGCCCTCA GACGTGTCCA TCCTGTACCA CATGAAAACG CTGCTGCTCC 60 
45 TGCAAGATAC TGAGAGATTG AAGCATGCTC TGGAAATGTT CCCAGAACAT TGCACGATGC 120 
CTCCTGCTTT TATTGGCTCT TGTCGAAATC AAATTGGAAG ATCTTCAGTC CCAGCTGCAC . 180 

CCAACGTGGA AAAGTATTCC AGGTCCATCC CCAAGGAACC AACACCGATG ACATGGACTC 240 

50 

AGGAATCTTA TAACCTACGT GGACTCITTC CATCCGTACA TTGTCGTGCA CATGCCACTC 300 
ATCACCTGGC GTGCCCAGAT CCTCGCARGG CAACACCCTG TGATAATTCC AGGTGATTCT 360 
55 CTACATCTGC AGCTTGAGGT TAGCCTCATA TCACATTACA TTCTCACTAN AAACNAAAAA 420 
AAAAAAAAAA AACTCNA 437 

60 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : doiible 

(D) TOPdJOGV: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGCACGAGCT GGAACAGAGA CTAAATCCCA CGAAACTGAC ATTGTTAAAC ACACTAAAAC 60 

AGAAGTACTT ACCTCTTGAA GATTTAATAT ATAATGGTTG ACATGATACA TGTACATGAT 120 

15 

GAATGACCAG ATGCTTATGG TCTACATTTT CCTTTATCCT GTTAGTATTA CCTTCCTTAA 180 

TCTTTGTTCA TTAACATGCT AATTCCTCTT CAGTC3TTTAT TTTCTAGTGA CAGAATGCTA 240 

20 ACATTTCTTA CACCCTGGCA GAAGGGAGAG AAATGTGTTT TGGGGTGGGT AACTAAATTT 300 

TTGAGTGAAA TATCATAAGA TGANAATGGA AANAAGGAGA CACAAANAGT TATNACAAAA 360 

AAACAATGGT TTTTTTAGCC ATTTGACTGG CTdTTAAAT AGTCTACAAG ACATTCACGT 420 

25 

TTAACATCAC TTTTAGTGAA ATAAAATGTG CCATACTAGT ATGTGCTTCA AAAGGGCAAA 480 

TGTGCTTTAG TGCCCTA7VGG CTAAATTTTG GTCATTTGAC ATCAGAGATG TTGTAAGTAT 540 

30 TGCACTTAAT AOGCACCTAT TTNTCAATAG TGTTATnTT TGC3JTAGCAT TTTTriTACC 600 

ACTATNTTGT TGATAGCTTT TTGTTCTNTN AGGTTGNAAN ATGACAGTGC TNATtTOZAAA 660 

CAGATTACCC ATNTGCAGAA CTAAGGGAAG CNATTTATGT ATGAAACSiAA TTNTTGAATT 720 

35 

NGTCATTNTC AACCNTTGNA TTAAAGCTTA GACTAAATAG TAATATATNG TGGGNAGGAT 780 

TTTGGTTTTG TGATATTTNT GTGNATTAAG GNATAGATGT TAACCNTTAT TTTGTAGNAA 840 

40 AGTGANTTGT ATGTGGTTAA TTATAAATAA AACTGGTACC AGGNAAAAAA AAAAAAAAAN 900 

NAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 943 

45 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIjOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GGCACGAGAA ATCTTCATGC TGTAGTCACT CCAGACCATG GAGTGGCTTT CCAGCTGAAT 60 
GAATCCTATG TCTCGCGTGC AGGTGGTTGG TTTTCAATGT TCTTGCTAAT TTTTTTTCTA 120 

60 
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TTGGATCTTG GGAGTTTTCT TTGTTTGCTC CTGTGTTTGC CXZAGCTTTAA TAAAACCAGG 
CGCAAACAAA AACCATAGCA TTCTGAACAA TAGGGGGCCC ACATTGQACC CAGTATGTCA 
CTTTAATGGA CTTCAAGAAA AAATCTGAAT GGGAAAAATG ACACTAGGAA TCTATACTCC 
ACACATTTTA TGCCATATAA TGGTGTGTTT TCTTAATTTT tf i ' l ' l T J'r il ^ni; GCGAAATGTC 
GCrrrCAAAT TAAAATGACC TTTTCTTCTT TGAAACTTTT TGTITTGACT TGTATAATTA 
AGGGTTTGGA AAGATTCATA ATTCTGAGAG AGGTTTGCAA CXAGGAGATA CAAAGAAGTC 
TCAGTAGTAA TCTTGTTCAT GTGCmTAC AGCCAGCTAC ATTTAAQGAT GTA1TAGTTA 
CAGAAATTAT ATGTCTGTGT ATGTGTCTCT ACTCAATAAA GTACATGCCT CCACAAAAAA 
AAAA 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GTGAGTGCCC GGGAGCCCCG AGGCCCTGCC CCTT^AGAAGG ATATCTYTRA CCGCTCCCTT 
GTCCACACCC TAACCCCCCA GCTGCTCAGG CAGTQGGCAC ATGGCAGGGG CCTCACTGGG 
GGCACATAGA GCATTTQGGG GACTGCGAGT GCTCACCTTT GACTTCCTGC AGGTCGGGGG 
AAAACCAGAT CATGATGACC AAAGTYTACA TATTCTTGAT CTTCATGGTG CTGATCCTGC 
CCTCCCTGGG TCTCACCAGG TATATGCCAC CACYTTCTGY TCTAAATTCA GAATAAGAGT 
CACATCAGGA GAGCACTGTC CCCAGGANAA TGCAAACGGG TTGGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 38: 
GTAGTCGTTG CGGTTGCCGG GATGGCGAAG ATCTCGCCGT TTGAAGTCGT AAAACGCACC 
TCGGTACCGG TGCTTGTTGG TTTGGTGATT GTWATCGTTG CTACAGAGCT GATGGTGCCA 
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GGAACXX3CAG CAGCGGTCAC AGGCAAGTAA 
TTTATCATGT CACCXZACTGT GGTATATGCG 
5 TTCAGCAATA ATCAGATGGC GGCTGGCGCA 
GCTGATGGTT CAGTGCCTGC GSCACCGTTT 
ACAGTTTTCG SACCGCGTTT AGCGTCAAGG 

10 

GTTCACOGCG AGCATAAGCA TTAAACATCT 
TACTTTCCAG CATATGTTTA CGCTGGCGGA 
15 AGGCAATGGG CTTAATGAGA TAATCAAATA 
CCATATCGCT GGCTGCAGTG GTAAACACCA 
CATGCAGTAA AT 

20 



ATACTAATGC CGGAGCAAGT OTCCTCCGGC 180 

TTGTGGTCTG CCAACTTTGC CGTGAACAAT 240 

ATATTCAAGA TAACGCCTGG CAGTGGTGCG 300 

YTGOCX3TATG TTGCACACCA GGNTCTTTAA 360 

GTTCAATCCC GGTOGGTAGC TCGTCCTTAG 420 

CATCAATTTG CTTCTQGCTG GCGCTATCAA 480 

AACGGGTTAG CGTTTGCCCC ARCMGWTCAT 540 

CACCACAACG TACGGCTTCA GACACCGTTT 600 

CGTCGCCGGG ATAATGCGCX: TGCACCAGTT 660 

672 



25 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 1908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 <D) TOPOLOGY: lineeir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AGAGTTGATA TTTTTAGAAA CAGTAATTTT ACTITTAAGG AAATTGGCTA GCTCTTTGAC 60 

35 

TNNAGAGCTG TAGGAAGCTC AACATTTCTT TGTAGAGAAC GTTGCTTTTT TTGGATTGTA 120 

CAGGTATAAA AACATTGCTT TTGTTGAATT GTATAGGTGT AAAAAGGGAA TAACTGTATG 180 

40 CAGGTTTGAA AAGGAAATGT GCTTTAGGCA TGAGTCATAA GATGCCATTG TACTTGTAGG 240 

CATTTTATTT TCCTTTAGAA ATGGACATCA GCTCTTCTCT TCTGACTGCT AACACATAGC 300 

CCCAAAGCAT GAGATTATTT TTCATTGGGT TTTTATTGTT GTTTAGTTTT GGTTTGTTAC 360 

45 

GCCAGCCCAG TCTGTCTGCG GAACACTGAC TCTGCTCTCT AATGAGAACA AAGTTAGAAA 420 

TCTGCCGATA ACCTAAAATA ATTTAGAAAT GAAOTAAAAA TGTGAAATCG GGTTAAAGTG 480 

50 ATGATGATAA AATAGCATGC AAGAAACAAG CTCCTTCCAT CAGACTTGGC TACTGTTTTC 540 

TTCTGGTACG ATTTGGTTTG GAAGAGCCTC TTGnTCCTT CTCTTTGGGG TATGTCTTCG 600 

TTTCTTAATA TGTTTGTAAC ATTATTGAGA TATAATTGAC ATACCTTACA ATTCACTTAT 660 

TTTAAGGGTA CAATTTAGTG GTTTTTAGTG TATTCACAAA GTTGTGTAAC CGTGACCACA 720 

GTCAATTTTA GAACATTTCG TTACCCCAAA AAGAAACCCT GTACCCTTGA GCAGTCACCT 780 

60 CTCATTTTCT CCCAGTGCCC ACCCCATCCC CGAGCCCCKG GAACCACTAA TCTATTTCTC 840 



55 
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TCTCTGTAGA TTTGCTTATT CTGGTCATTT CATATAAATG GAATTCTACA ATATTCGGTC 900 

TTTTQGGACT GGCTTCCCAA ATATGATTTT CTATATQGAG TGAGAAAATT CTTCTCATCT 960 

5 

TGAGAACTCT TATTGCTGTG AAAGGGAGTG GTTGGTAAAA TCAATAGATT TCAGGCAAGA 1020 

GGGCCAGATA CCTAACAGGT TTTTCTCCGT GAATCTTATG CTGAGTAGTT TTTCCTCATA 1080 

10 ACXIAAGCATT TATGATATAT TACTACTTAT AATACTGTGG CTAGTCTCTA GAATGGATGT . 1140 

TGAAATCTTT GCCTCCTCAG TCGGGAAGAG TCCTGCTAAA. AATCAGGCTA AAAATCAGGC 1200 

CAAAAATCAG GCCAAATGAC TTGGCAAATA ATTGACAAAG TGGTTTTCAC GTGTGTCTAT 1260 

15 

CTTTGCTAGC AGCTTGTATA CCTCAGGCCA GGTGAGCTCC CCAAATTTCT TTTTTCATTT 1320 

ACTCCAGTGA GTTTCTGCTG TCTTTTTCAA GTATGTACCA TAGGACTTAA AGGTGATTTG 1380 

20 GATGCGTTGT AACACTGCTA AATATGCTAA GTACAGAATT TTATCTACAG TACTGTGAGA 1440 

CAGTCAATTA TTGCCTAGGG TAGTTCAAAA ATATGATGTG AGCTAGTTAA GCCTTTGCTT 1500 

GACTGATTTC AGTGATATTC AGAAGTGTGT ACCAATCAAG GCTCTTTAAA ATACGGAACG 1560 

25 

ACTCACTTAA TAACCAGGGA ACCAGCXIAAA TACTGTGCAG CCGCAGAATA TGCATATCAA 1620 

TGAGTTGGAG GTGATTATTC TCTGTAACTC CCTAATGATT GTTTrCTAAG CATTGTGGCT 1680 

30 TCTCAGTGGC TTGACAGCAT CTTCCTGGTT GTATGTGGCC TGTTTACATG ATGTATTGAA 1740 

TAATGTTCTT TGTTGTGAGC ATCAATGCCT GTAACACCAA ACTAAACACG TGTTTTTGGG 1800 

ATATGTTTCC AATCTTTAAA TGACCTTGCC CTGTCCAATA AATAAATGAT TGTCTCACCC 1860 

TGTTAAAAAA AAAAAAAATT AAAAAAACTG GGNGGGGGGC CCGGTACN 1908 



35 



40 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 
45 (B). TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

50 

CCTCAAAAAA AAAAANGAAA GGAAAGAGGT CTCTACACAA GCCCGTGATT CTTCATGGCA 60 
AGGGATAACA TCAGAAATGT TTCATTTYCK GCTATTAGTT TCCATTCCTT TCCCCATCCA 120 
55 GGCATAAAGA GAAACAAAAG ACAATGATGG TATTCTCTGT GTCCTCAGCT TTGGCACTTT 180 
TGTTGATGTT GCTAAGGAGC AGTGACCTTG CTAAAAAGAC TGAATAATCC ACCCACTGAA 240 
TAGCTAACCT GGGGAGGAAA TGAAAATTTC CTTTGTGGAT CTCCCCAAAT CCATTGrTTOT 300 

60 
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CACCAGGCCC TCCCAGAACC TCCTCAGTTC CTTCACAGTG CAACCCTGTG TACTTGGCCC 360 
GCAT^CAAT AGTATTGTGC CTCACTTCAC CTTCCATGGG CAACTGCCCT CCCTTCTCGA 420 
5 CATAAAACCT CATATTTTAA ATNAAGTTGA AATTTGAA 458 

10 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 base pairs 

(B) TYPE: nucleic acid 
15 <C) STRAND(E£]MESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

20 GGCACAGAGC CTCCGACCCA GGTGGTCTGG AGCCTGCCGG GAGAGTGGTG GCATCTGAGA 60 

GGCTGGTCGT GGACTGTC3GT TGGGGGAGGT GGGAGCTGTT TTAACCGTGT GCCCCCTCTC 120 

CTGTGCCGGC GTGGGCATCC CCCGGGGCAG TGGAACCCGG GCGCTCCTCC AGCTTCCGAG 180 

25 

TCCAGCCAGC CTGGGCGCGG GGCGCGCCCC GAGACACCCG AGGAGTCCGT TCCTCCCTGG 240 

TTACGTGGAC TGTGGAGCTG GTCTCTTGTG GCTCAGCGCC GTGCGGAGGT TQAAGCGTAC 300 

30 CTGCGGAGGT CGCACCAGGG CGTGAGGAGG AGGAGGAAGG GCATGAGCCG AGCTTGAGGA 360 

ATCCGTGCTC CAAACTCTAC ACTCAAGGAT GCACTGCGCA ACTCTGGTGG CGATGGGCTG 420 

GGGCAGATGT CCTTGGAGTT CTACCAGAAG AAGAAGTCTC GCTGGCCATT CTCAGACGAG 480 

35 

TGCATCCCAT GGGAAGTGTG GACGGTCAAG GTGCATGTGG TAGCCCTGGC CACGGAGCAG 540 

GAGCGGCAGA TCTGCCGGGA GAAGGTGGGT GAGAAACTCT GCGAGAAGAT CATCAACATC 600 

40 GTGGAGGTQA TGAATCGGCA TGACTTACTTG CCCAAGATGC CCACACAGTC GGAGGTGGAT 660 

AACGTGTTTG ACACAGGCTT GCGGGACGTG CAGCCCTACC TGTACAAGAT CTCCTTCCAG 720 

ATCACTGATG CCCTGGGCAC CTCAGTCACC ACCACCATGC GCAGGCTCAT CAAAGACACC 780 

45 

CTGCCCTCTG AGCGTCGCTG GATCTCTGGG AGCTCCTTGA TGGCTCCCAG ACCTTGGCTT 840 

TTGGGAATTG CACTTTTGGG CCTTTGGGCT CTGGAACCTG CTCTGGGTCA TTGGTGAGAC 900 

50 TTGGAAGGGG CAGCCCCCGC TGGCTTCTTG GTTTTGTGGT TCXXIAGCCTC AGGTCATCCT 960 

TTTAATCTTT GCTGACGGTT CAGTCCTGCC TCTACTGTCT CTCCATAGCC CTGGTGGGGT 1020 

CCCCCTTCTT TCTCCACTGT ACAGAAGAGC CACCACTGGG ATGGGGAATA AAGTTGAGAA 1080 

55 

CATGAGTTTG GGCTGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1140 

AAAAAAAAAA AAA 1153 

60 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LEIOGTH: 19B3 base pairs 
<B) TYPE: nucleic acid 

(C) SmANDEENESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

GGCACGAGAG GGGCCGAGCC GACAAGATGT TCTTGCTGCC TCTTCCGGCT GCGGGGCGAG 60 

15 TAGTCGTCCG ADGTCTGGCC GTGAGACGTT TCGGGAGCCG GAGTCTCTCC ACCGCAGACA ' 120 

TGACGAAGGG CCTTGnTTA GGAATCTATT CCAAAGAAAA AGAAGATGAT GTGCCACAGT 180 

TCACAAGTGC AGGAGAGAAT TTTGATAAAT TGTTAGCTGG AAAGCTGAGA GAGACTTTGA 240 

20 

ACATATCTGG ACCACCTCTG AAGGCAGGGA AGACTCGAAC CTTTTATGGT CTGCATCAGG 300 

ACTTCCCCAG CGTGGTGCTA GTTGGCCTCG GCAAAAAGGC AGCTGGAATC GACGAACAGG 360 

25 AAAACTGGCA TGAAGGCAAA GAAAACATCA GAGCTGCTGT TGCAGCGGGG TGCAGGCAGA 420 

TTCAAGACCT GGAGCTCTCG TCTGTGGARG TGGATCCCTG TGGAGACGCT CAGGCTQCTG 480 

CGGAGGGAGC GGTGCTTGGT CTCTATGAAT ACGATGACCT AAAGCAAAAA AAGAAGATGG 540 

30 

CTGTGTCGGC AAAGCTCTAT GGAAGTGGGG ATCAGGAGGC CTGGCAGAAA GGAGTCCTGT 600 

TTGCTTCTGG GCAGAACTTG GCACGCCAAT TGATGGAGAC GCCAGCCAAT GAGATGACGC 660 

35 CAACCAGATT TGCCGAAATT ATTGAGAAGA ATCTCAAAAG TGCTAGTAGT AAAACCGAGG 720 

TCCATATCAG ACCCAAGTCT TGGATTGAGG AACAGGCAAT QGGATCATTC CTCAGTGTGG 780 

CCAAAGGATC TGACGAGCCC CCAGTCTTCT TGGAAATTCA CTACAAAGGC AGCCCCAATG 840 

40 

CAAACGAACC ACCCCTGGTG TTTGTTGGGA AAGGAATTAC CTTTGACAGT GGTQGTATCT 900 

CCATCAAGGC TTCTGCAAAT ATGGACCTCA TGAGGGCTGA CATGGGAGGA GCTGCAACTA 960 

45 TATGCTCAGC CATCGTGTCT GCTGCAAAGC TTAATTTGCC CATTAATATT ATAGGTCTQG 1020 

CCCCTCTTTG TGAAAATATG CCCAGCGGCA AGGCCAACAA GCCGGGGGAT GTTGTTAGAG 1080 

CCAAAAACGG GAAGACCATC CAGGTTGATA ACACTGATGC TGAGGGGAGG CTCATACTGG 1140 

50 

CTGATGCGCT CTGTTAiCGCA CACACGTTTA ACCCGAAGOT CATCCTCAAT GCCGCCACCT 1200 

TAACAGGTGC CATQGATGTA GCTTTGGGAT CAGGTGCCAC TGGGGTCTTT ACCAATTCAT 1260 

55 CCTGGCTCTG GAACAAACTC TTCGAGGCCA GCATTGAAAC AGGGGACCGT GTCTGGAGGA 1320 

TGCCTCTCTT CGAACATTAT ACAAGACAGG TTGTAGATTG CCAQCTTGCT GATGTTAACA 1380 

ACATTGGAAA ATACAGATCT GCAGGAGCAT GTACAGCTGC AGCATTCCTG AAAGAATTCG 1440 

60 
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TAACTCATCC TAAGTGGGCA CATTTAGACA TAGCAGGCGT GATGACCAAC AAAGATGAAG 1500 

TTCCCTATCT ACGGAAAGGC ATGACTGGGA GGCCXyVCAAG GACTCTCATT GAGTTCTTAC 1560 

TTCGTTTCAG TCAAGACAAT GCTTAGTTCA GATACTCAAA AATGTCTTCA CTCTGTCTTA 1620 

AATTQGACAG TTGAACTTAA AAGGTTTTTG AATAAATGGA TGAAAATCTT TTAACGGAGA 1680 

CAAAGGATGG TATTTAAAAA TGTAGAACAC AATGAAATTT GTATGCCTTG ATTTTTTTTT 1740 

CATTTCACAC AAAGATTTAT AAAGGTAAAG TTAATATCTT ACTTGATAAG GATTTTTAAG 1800 

ATACTCTATA AATGATTAAA ATTTTTAGAA CTTCCTAATC ACTTTTCAGA GTATATGTTT 1860 
15 TTCATTGAGA AGCAAAATTG TAACTCAGAT TTGTGATGCT AGGAACATGA GCAAACTGAA " 1920 
AATTACTATG CACTTGTCAG AAACAATAAA TGCAACTTGT TGTGCAAAAA AAAAAAAAAA . 1980 

AAA 1983 

20 



(2) INFORMATION FOR SEQ ID NO: 43: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) USNGmi: 1406 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
30 (D) TOPOUX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATGATGATGA CTITGAAGAC GATTTTATTC CTCTTCCTCC AGCTAAGCGC CTTGAGGTTA 60 

35 

ATAGTTGGAA AAGACTCTAT AGATATTGAC ATTTCTTCAA GGAGAAGAGA AGATCAGTCT 120 

TTAAGGCTTA ATGCCTAAGC NCTTGGTCTT AACTTGACCT GGGATAACTA CTTTAAAGAA 180 

40 ATAAAAAATT CCAGTCAATT ATTCCTCAAC TGAAAGTTTA GTGQCAGCAC TTCTATTGTC 240 

CCTTCACTTA TCAGCATACT ATTGTAGAAA GTGTACAGCA TACTGACTCA ATTCTTAAGT 300 

CTGATTTGTG CAAATTTTTA TCGTACTTTT TAAATAGCCT TCTTACGTGC AATTCTGAGT 360 

45 

TAGAGGTAAA GCCCTGTTGT AAAATAAAGG CTCAAGCAAA ATTGTACACT GATAGCAACT 420 

TTCCACACAG GACGTTGAAA ACAGTAATGT GGCTACACAG mTTTTAAC TGTAAGAGCA 480 

50 TCAGCTGGCT CTTTAATATA TGACTAAACA ATAATTTAAA ACAAATCATA GTAGCAGCAT 540 

ATTAAGGGTT TCTAGTATGC TAATATCACC AGCAATGATC TTTGGCTTTT TGATTTATTT 600 

GCTAGATGTT TCCCCCTTGG AGTTTTGTCA GTTTCACACT GTTTGCTGGC CCAGGTGTAC 660 

55 

TGTTTGTQGC CTTTGTTAAT ATCGCAAACC ATTGGTTGGG AGTCAGATTG GTTTCTTAAA 720 

AAAAAAAAAA AAAACGACAT ACGTGACAGC TCACTTTTCA GTTCATTATA TGTACCGAGG 780 

60 GTAGCAGTGT GTGGGATGAG GTTCGATACA GNCGTATTTA TTGCTTGTCA TGTAAATTAA 840 



I 
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AAACCTTGTA TTTAACTCTT TTCAATCCTT 
TGGTGCTTAT TTTTTCAAAA ATTTGCTCTG 

5 

CTAATGAACT ACAGCTATCT TAATTTGGTT 
CTACAAC3GAA TATTAAAAAA ATCTATTCAC 
10 CATX3AGTCAC AGCTTTTGTT CTGTGGTAAC 
TGTAAAGAAC TGAAAACAAT GTATATGTTG 
TCATAAGAAA TTAAAAGAAC TTACCAGGAA 

15 

TAAAATAGGA AATTATAAAT ATATAGTTTT 
TAGTTATGTA GTTATTATGN AACCACCAAG 
20 GGCCGATTAA NTGGTTTGAA GNCTTG 
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267 

TTAGATAAAA TTGTTCTTTG CAAGAATGAT 900 
AACAACGTGA TGACAACAAG CAACATTTAT 960 
CTTCAAGTTT TCTGKTGCAC TTGTAAAATG 1020 
TTTAACTTAT AATAGTTTAT GAAATAAAAA 1080 
CTATAAAAAA AGTTTGTCTT TGAGATTCAA 1140 
TAAATATTTG TGTGTTGTCA GAAATTTTTC 1200 
GGTTTTTAAG TTAGAAATAT TCCATGCCAA 1260 
AAGCCTGCAT CAGTGGGAGT CTTGGCTATG 1320 
ATTTTTTTGG CTATTTACCG TAACCAAAGG 1380 

1406 



25 (2) INFORMATION FOR SEQ ID NO: 44: 



(i) SEQUENCE CHARACTERISTICS: 

(A) I^ENGTH: 1391 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
35 GGGCCTGAAG GCGGCRCGCC AGTCCCGAGC AGTGCTCGCT CCTGCTCGGG GCGCTGCGGC 60 



CCCGGGCGTC GCCATGACCA GTGAGCTGGA CATCTTCGTG GGGAACACGA CCCTTATCGA 120 



CGAGGACGTG TATCGCCTCT GGCTCGATGG TTACTCGGTG ACCGACGCGG TGGCCCTGCG 180 

40 

GGTGCGCTCG GGAATCCTGG AGCAGACTGG CGCCACGGCA GCGGTGCTGC AGAGCGACAC 240 



CATGGACCAT TACCGCACCT TCCACATGCT CGAGCGGCTG CTGCATGCGC CGCCCAAGCT 300 



45 ACTGCACCAG CTCATCTTCC AGATTCCGCC CTCCCGGCAG GCACTACTCA TCGAGAGGTA 360 



CTATGCCTTT GATGAGGCCT TTGTTCGGGA GGTGCTGGGC AAGAAGCTGT CCAAAGGCAC 420 



CAAGAAAGAC CTGGATGACA TCAGCACCAA AACAGGCATC ACCCTCAAGA GCTGCCGGAG 480 

50 

ACAGTTTGAC AACTTTAAAC GGGTCTTCAA GGTQGTAGAG GAAATGCGGG GCTCCCTGGT 540 



GGACAATATT CAGCAACACT TCCTCCTCTC TGACCGGTTG QCCAGGGACT ATGCAGCCAT 600 



55 CGTCTTCTTT GCTAACAACC GCTTTGAGAC AGGGAAGAAA AAACTGCAGT ATCTGAGCTT 660 



CGGTGACTTT GCCTTCTGCG CTGAGCTCAT GATCCAAAAC TGGACCCTTG GACCCGTCGA 720 



CTCACAGATG GATGACATGG ACATGGACTT AGACAGGAAT TTCTCCAGGA CTTGAAGGAG 780 

60 
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CTCAAGGTGC TAGTGGCTGA CAAGGACCTT 
GCTCTCCX3GG AAAGCTGGGC GTCTTCTCTG 
5 GGGGGCTGGT GAACGTGCCG CXAAGCTGAC 
GGACCTCGTG GAGAAGTTTG TGGAACCCTG 
GCGGTTCTTC CTGAATCAGT ATTCAGCGTC 

10 

GOCXrCTCTGG GACCXXTTACA TGGGCACCCT 
CTGAGGTGCC TCXXAACGTC CGCCCACGCT 
15 GGTCCTCGCT CCX5GGGAGCA AGTGGQGGGC 
ACCTGGTGTC CGTOTACAAG GATGGATGTG 
CTCAGGGAAT GGTGTCTGTG CTCAGCCCAT 

20 

AAAAAAAAAA A 



268 

ctggacctgc acaagagcct gc5tgtgcact 840 

agatggaagc caacttcaag aacctgtccc 900 

cx:acaataaa gatgtx::agag acctgtttgt 960 

cxxxrrccgac cactggccac tcagcgacgt 1020 

tgtccaatcx: ctcgatggct tccgacacca 1080 

ccgcggctgc ctcctgcgcc tgtatcatga 1140 

gacaataaag ttgctctgag tttggagact 1200 

gtgcagatgt gcctgtgtct gtctctgagc ' 1260 

tocnc3tggct ccttgggaac tgagacatat. . 1320 

cx:accagaag AcrrcrGcrcA caaaaaaaaa 1380 

1391 



25 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotible 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

35 

GGCACGAGTG GAGATGGCTG CGGCCGTGGC GGGGATGCTG CGAGGGGGTC TCCTGCCCCA 60 

GGCGGGCCGG CTGCCTACCC TCCAGACTGT CCGCTATGGC TCCAAGGCTG TTACCCGCCA 120 

40 CCGTCGTGTG ATGCACTTTC AGCGGCAGAA GCTGATGGCT GTGACTGAAT ATATCCCCCC 180 

GAAACCAGCC ATCCACCCAT CATGCCTGCC ATCTCCTCCC AGCCCCCCAC AGGAGGAGAT 240 

AGGCCTCATC AGGCTTCTCC GCCGGGAGAT AGCAGCAGTT TTCCAGGACA ACCGAATGAT 300 

45 

AGCCGTCTGC CAGAATGTGG CTCTGAGTGC AGAGGACAAG CTTCTTATTG CGACACCAGC 360 

TGCGGAAACA CAAGATCCTG ATGAAGGTCT TCCCCAACCA GGTCCTGAAA GCCCTTCCTG 420 

50 GAGGATTCCA AGTACCAAAA TCTGCTGCCC CTTTTTGTGG GGCACAACAT GCTGCTGGTC 480 

AGTGAAGAGC CCAAGGTCAA GGAGATGGTA CGGATCTTAA GGGACTGTGC CATTCCTGCC 540 

GCTGCTAGGT GGCTGCATTG ATGACACCAT CCTCAGCAGG CAGGGCTTTA TCAACTACTC 600 

55 

CAAGCTCCCC AGCCTGCCCC TGGTGCAGGG GGAGCTTGTA GGAGGCCTCA CCTGCCTCAC 660 

AGCCCAGACC CACTCCCTGC TCCAGCACCA GCCCCTCCAG CTGACCACCC TGTTGGACCA 720 

60 GTACATCAGA GAGCAACGCG AGRAAGGATT CTGTCATGTC GGCCAATGGG AAGCCAGATC 780 
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10 



15 



20 



25 



30 



CTGACACTGT TCCGGACTCG TAGCX:AGCCT GTTTAGCXAG CXXTTGCGCAT AAATACACTC 840 
TGCGTTATTG GCTCTGCTCT CCTCAATGGG ACATGTGGAA GAACTTGGGG TCGGQGA£?ro 900 
TGTTTCTCAC TTGGTTTrCA CTAGTAATGA TATTGTCAGG TATAGGGCCA CTTOGAGATX3 960 

CAGAGGATTC CATTTCAGAT GTCAGTCACC GGCTTCGTCC TTAGTITrCC CAACTTGGGA 1020 

CGTGATAGGA GCAAAGTCTC TCCATTCTCC AGGTCCAAGG CAGAGATCCT GAAAAGATAG 1080 

GGCTATTGTC CCCTGCCTCC TTGGTCACTG CCTCTTGCTG CACGGGCTCC TGAGCXCACC 1140 

CCCTTGGGGC ACAACCTGCC ACTGCXZACAG TAGCTCAACC AAGCAGTTGT GCTCAGAATC 1200 

GCACCTGGTG AGAGCCTGCT GTGTGCCAGG CTTTGTGCTC AGTGCTCTTA CATGTATTAG 1260 

TTCCTTTACT GCTGACCACA TTGTACCCAT TICACAGAGA AGGAGCAGAG AAATTAAGTC 1320 

GCTTGCTCAA GGTCATGCAG TTAGTAAGTG GCAGAACAGG GACTTCAACC AAGCXXTTCTC 1380 

CTCTGAAGAC CGCGTCCTGA ATTTCTTCAC TAGAGCTTCC TCATCAGGTT ACCCAGAAGT 1440 

GGGTCCCATC CACCATCCAG GTGTGCTTGG ATGTTAGTTC TCCACCCTCG AGGTCTACGC 1500 

TGTGAAAAGT TTGGGAGCAC TGCTTTATAA TAAAATGAAA TATATTCTAA AAAAAAAAAA 1560 

AAAAAAAAA ^5^9 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: lineax 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 

GGGCCCcccc wcGWKTTTTT ' iu ' rrrm ' rr tttaattagg 

AATGAAACGT TCATTCCTCC TTCCACTCCT TCTCGTTGGT 
GATCCTGCTA QAAACGTTGT CAGTCTGCTT GTGGCTTCCC 
TGTGATGTCT TGAGAAGTAT CTATCCACTT CATGTGAATG 
CATCAATCAT TCTTACCTAA AGAATAATAA GAAAAAGTTA 
AAATAAAGGT TTGAAAATGC TAGTCAACTT CAAAATTTAA 
AAGATTGGGG GTAAGTTACA GCATAAAAAA ATAGGAAGAA 
TCTAAAATTA TTCTTACATA AAATAAGTAG ACACCTGAAT 
CTTTAAAATG TAAAAGCCTG ACTCTCAGTT TCACCAGTCT 



46: 

ATAATGCCTT TATTAACGAG 60 

TTTCTGGACA CAGCTCACCT 120 

TCCTTGATTG ACTCACGCTG 180 

AGCACTCCAA TATCAGCCAA 240 

ATATAAAAGA CAAGGGTATA 300 

AGAGTAAAAA TCCAGAGATA 360 

ACTTCATGGT GGGGGGGAAA 420 

TAGAATGAAA ACTGTATTTT 480 

GAGCACAAGT TTGACTGCAA 540 
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CCXy^AAATAT ACTATCCCTT ATGTGAAGGT ATGTGACAAC GTTGACCTCA CCAAATGAGT 600 

TTTAACATCA GCTCTTTTTT. CATATGAAAG CACATACCCT GCTCCCCATT CAAGTATGTC 660 

5 TTCCATTGTC AGGCAGGCTG AOCACCITCA GCAGGAGTCC TCCAAGAGTG CXTCAACTCCC 720 

CTTCCCACAG TACACAACGC TGTAGTTCTT GTCCTGCAAT CCTTTOTATT TACCTCATTC 780 

TTTCCCATCT AAGTCCTCAC TGAGTTTTAA AGTTAGGGCT GGAAAAGCTA TGCCTTACTG 840 

10 

GGACAGCAAG GAACCAATTT TTTTCTGAGG GAGAAGACAT TCACCTTCAC TATATGCCTC 900 

GCAGGGCXrAC AGTGCACAAA ACAAAGATCA GCCTTCATTC AAGTTCCAGG TTTTrCTTCC 960 

15 TCCCTGAATG ATTACTGCAA AGGGTATATG AAGTAAGAGT TCCCTGTTGC ACATCTACCA 1020 

TCCATAAGGG ATACTATATC GTTTTGCATT OTTCCCCCCA TTCTCCACAT TGTCCTATCT 1080 

TAAGTCCAAG CCCTTTTCAC TCTCAAAAAA AAAAAAAAAA TATTTTTTTC AGCACTGGTG 1140 

20 

TTCAAAAGCA ACGTTTTTAT GGTTAATGGT TTACX:AGCAA CTGTTGAGAT TTCCAGTrGA 1200 

GTCTTAAAAA TTGCCAATCA TTATCTAGCA GCAATGACAG ATGATTAGGA GCAGTCAAAT 1260 

25 CCTCTGAATT CTTTCCCTAA TAGGCAGCCA TTTGAGAACT GCACTAGCTG ACATCACTAA 1320 

AACATTATCA GCTAAAGCCA AAACCAAATA AAGGCCX:aGA CCAACATCCT GGCTCTCTAA 1380 

AACCTGTCCA AAATCATTAA GTGAAAGGCA GTAAATGCAG GACTGTGGAT CATX3TCACTG 1440 

30 

CAGCTGACAA TGATTAACAA TAGGAGACAT GGAACXTCCCA TTAAGGTTAA AAGTCCAAAA 1500 

CTAGTCACAC GCATCTCTTT ATTGGGGAAA AGTGAGACTA TTATGCATTC TTGGTAGGTT 1560 

35 TGCAACCTTG CATGAAGAGC ACCCATTGCA TTTCTTTCAT CTTTCAGAAA GCACCGGTAT 1620 

CTGTTCCAAG GGCCTAACAG TACGAAAATA CATTCTGGCA TCACACCTCT GAACCCAAGA 1680 

CTGTTCTCAT TAAAAATAAT TTTGGTTTGT AACAAAATTA TGAAATACAA TGCAAGCACC 1740 

40 

TCGGTATAGC ATTATTACTG AAACCACTTA ATTCCCAGCT TTTTQAGTTT TTTAAAAAAA 1800 

CCCACTGCAC TAAGATTCAC AATTCATTGC TACATACAAA TTAAAGCTAG TAAGAACACA 1860 

45 CTAACGTCAC AAGTTTCTCA TTCTAAAGTG CAAAAGCCTA ATCATCTGAA AGTGAACAGG 1920 

GTAA 1924 

50 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOIjOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 47: 
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10 



15 



20 



30 



40 



45 



50 



55 



60 



TGGTGTX3GGG CCCAGAAAMC AAGGGACCAG TGAAAACAMC CCCAGAGACT TCTATCCGCC 

AGGAAAGCCA TTGCCAMTYC TGAGCCCTTG AAGGGCAAGG AGGGAAACAG TCTTACCAGA 120 

GCCCAGTAAG AACTGCTGTC ATGAAGGAGG GGCCACCTTG TAAGAGACAT CATTACTACC 180 

AGAACTGTGG TGCCAAATTG CTGGTGTCTC TCTTTGGAGA AACCAACCAG ATACATCTOC 240 

TGGAGACCCA GGTGGGCACA GAGAAGGGTG GAGAGAGAAT CTCGGAAGAG AAATCGAGAA 300 

TAAGCAGCAC AGTGTTATTC ATTTCTCTAA ATTCCTATGT AGAAGGCTCA CTGTTAGAAA 360 

TAAAGTTATT CTACTAGTTG CAAGTTAAGT GTTTCTGTTT GTTCTCCTrr CCTGITAGCA 420 

TAAGTAAACT CCCTirGGAA CTACACAGGT ATGTCTCTCC TTCAACATCT GTGAA 475 

<2) INFORMATION FOR SEQ ID NO; 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

AAGC3GACAGA QACCTGGATT CAGATCTCAT TTTACAATGA AGACCCCAAT GCAGAAAGTC 60 

ATGTCTGAAA TTCTGAGCTT ACTCTTCTGC CTGCTGGGAC CTCCTCTGGA TGAGAGAAGG 120 

35 GAGGAAAAGG ACTAATCAGA GGAGCCAATG AAGTCACTCC ATGAGTTTCC TGAACCCTCC 180 

CCAGCTAGAG ATTAACGTYT GACCWTCAAC GTAGGACACT GTCCAGATOG CTACITOGTC 240 

GCGCACATGA AGACCAAAGC CAGGACCAAG CCCCMASCCT GCTWAACACG GCAGARTCTT 300 

GCCCAGCCMA CYTCTGTGAR AATCTGCTTC CCTCCACAGC TCACCC 346 



<2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1366 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEEa^ESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

TAGGTGTCAG CCGCCACCCC CCCCCCATAT GCAGATTTAC TSGGCATCGT AGTOGCCAGC 60 

TTCTAACACA GCTGGTATTT CAAGTCTCCT GGGACCTCAC TCAGGAATGA TACCCCCTCA 120 

GTAGAAGCAG CAGGTGATCT TAACTCCTTT CAAAGAGCAG GCCTGTCTCG GAAGCCATCT 180 
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CCTCAGCAGG CACAGCAACC CCTCTGGAAA TGGATCACAA ACTCACTTCT CAGCXiAGGCA 240 

GGCCAAGCTT CTATTGTAAC AGTAGGCACA GTATAGTCGG ATCATCACAT CPSXTGOGTV 300 

5 

TTTGGTTTAG TCATCTAGAG TCGTCTGGAC TAAAGGTCTT TCAGGTCTCC TTGCCCTGTG 360 

AGTGCGTGAA CCTCCCCACC CX3AATTGCCT CAGTTGTCCT GAGCCTCATG TCTCTCCTGG 420 

10 TGGTGGGCCA GGCCCCTGCA TGGGAAGGGA GCCTOCTGCG GGGCAGGCCA GCTGGGGGTC 480 

CTCACCTATG CGCAATGANA GTTATTGAAG GACTGGTTCT TGATGTTGGT GAGCGTATCC 540 

TTCATGGCCA GCGCXSAAGTC GGCCAGGTCA GCCAGGTGCT GCCAGCGCTC TCTCTCGGAC 600 

15 

TTGTCTTCCT GTGCCAGGGG ACCGTGGAGA AAGTGTCAGG CGCCGCrrCAC TGCAGCAGCC 660 

TGCTCTGCTG CCTTCCCTGG CAGTGTTCTG GGGGTGGATT CXTTACAMCT AGATGTTCAA 720 

20 GGCCTTACTT TTCCTCCCAC AAAQGAGTCG CAGCCACGCT AGCTCTGACT TGCCACTGTG 780 

ACAAAGTTCA CGTAGCAGGT CTAGGCAAAG ACTGGGCAAT TGAGCAGAGG AGACGGACCT 840 

GTGAGTCTGA CCRYGAGSCG GRCCCCTTCA CXnTGGCTGG GCTGGTCCTG GTCCTTAGGT 900 

25 

TTTGTCAGGT TGTCCTTGTT TGGATCCCTC AACTAGGTGA TAAGCACTGG AGGGGGATGA 960 

CCCGCCTTGG ACGTGTTTCT TTAACCTCAT CCATATAATA GGGCCC5TGGG ATGGTTGTAG 1020 

30 AGGTAAAGCA GGATGAOGGT GTTOTAAGAC CAGAGCTTGG GACCAGGGCT CCTACACXTTA 1080 

ATTTTCTCTC CTGGTAGCTG AACAAAGGTC TAAATTAGCT TAACAAAAGA ACAGGCTGCC 1140 

GTCAGCCAGA GTTCTGAAGG CCATGCTTTC AGTTTCCCTT GTTGACAATT GCTCTCCAGT 1200 

35 

TCCTATGAAA GCACAGAGCC TTAGGGGGCC TGGCCACAGA ACACAACCAT CTTAGGCCTG 1260 

AGCTCTGAAC AGCAGGGGGT TGTGTGTCTG TTCTGTTrCT CTGCTTGCCG AACTTTCTCA 1320 

40 ATAAACCCTA TTTCTTATTT ATAAAAAAAA AAAAAAAAAA AAAAAA 1366 



45 (2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



55 GCAGTAATTC CTGTTAGCCA CTGCATCCAC CAAAACTAGT TTArTTTTCC CCTCAAATTC 60 

ATGATTTTTA CGTCTGTTAC AAAGGGAATT TTGCTGATAG CTCTTTGGGT CCCACTGTTC 120 

CATTTTATGC TAATAGATTC CATTCTAGGG CCCAGCCGTC TCTTGACTGA TGGTGTTCCC 180 

60 
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TTTAACCCTT GGCATGTATA ATAGAATTTT GGTGAATGAA AGAACCCAAA TAGGCCAGAT 240 

AGTCCCCCCA GGCCCTGATA TCCATAAAAG GCTTCGGAAT GCATTATGTA ATTGTCCTTA 300 

GTCTTTTTGT TGTTTTAGAA AAAAAAAACA AGATGGGCTC AGATGGATGC CTACGTAAAA 360 

ATGGTTCCTA GCTGTGTACT CATAACTTTT CTTTGAATTG AGTAGTGAAA GGAAGGAGGA 420 

GGAAAGGAAA TTAAATGTCC TTCTAGTATT CTTCTCGACTC AAGTCTGACA TATGAGATAA 480 

TAACCTATAT TGAAATGCCA AGAATTGTAT CTGAAACAAG AGAAGAGTTT GACACATTTA 540 

TCATGCCTTC ATATTACATA TTAACTGAAA CXTAATTAATA AACATATGAA ATATCCATTG 600 

15 CACAAGGCAA AGGCACCTAA ACCTTTTGTT TCTTTTTCTA CATAGCAGAA ATTGATTTTT ' 660 

TTTTTATTTT TTTAGGGGAA CCTATATAAT TATGACCCAG TGATGTCTTT TGGTGACTTA 720 

AGCTTATGAA TTCAGGTrAC AATTCAGTTG ATTCTAGATG GTTACTACCT TGAAAAGGAT 780 

20 

GTTGGTGCCT TATGTGACAC GAGCCAGAGC CTGCTGQGGA ATAAACAAAG CAGGTTTCAT 840 

GCCAACACCA ACTCGTAGCT TTAGTX3GGCA GATGGGGAGT GGTTCACAGA CTTCCCAAAA 900 

25 TGTGGGGGCT TTGGGATTTT CCACACCATC CrACX^TGTGT TGTTCATrCT TCCTCTTTTC 960 

ACACTCTTGG ATGGATWATT TGRAAATGGT, GRAAWVMMCY YYKRAATTTG CCCAATAGCC 1020 

V/TGRGCCACC ATTCTTWATG ACACCATAAC CAAATAGTTC CWTAATGTTG AAATATTAGA 1080 

30 

AACCTGTTAC CAGCCYKSMA KTWACCCWWA WTTTTCCCAT GTTTGTGGAA TTGATATTGA 1140 

AATAGCAGGG CTAAGGAATT ACTGGCAAGT TTTAGCCTGT GGGTAATACC TTAGGGTTAT 1200 

35 TTAAATATTT GTAATTTTAT TTAAATGTTC ATGAATGTTT GAAAGGAACA AAATTATCAG 1260 
GGATGGCTCT TTGCCATGGG TCTTATTTTC ACCCTCTTTT- CTGTAAGAAA AAAGAAGAAT- - - 1320- 

GTCTTAATGT ATTTTTAAAG TTTTTGGTAT AGTOTCTAAT TCCAATTTTA ATAAAAGTTT 1380 

TWTRTAAAAA AAAAAAAAAA AAAAA 1405 



40 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CGGATTTTCT AGGACCCCAA AAAAAAAAAA AGGGNAAAAA AAACCCNCAA AACCANCCAA 60 

AACCCCAAAA AAAAAAAAAA TCCACAAAAA CAAAAAAACT ATAAAAAAGA AAGAATTAAA 120 

60 AACTTTCAGA GAATTACTAT TTACTTTATT AACTTACGGA TTTATTATAT AAATATATAT 180 
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TCACCTAGCA ACATATCTCT GCCGTCTCTC CTGCTCTCAT AATGAAGACA TAGCCGATTC 240 

^ TCTGCCCGGG CXXCTTGCTG ATGCTCCTCC GGGTCTGCGT CGGGCGTGGG TCTCTGGGGA 300 

CCCTCCAGAG GTGGAGGTGG GCTGATGGCC TGGCTGCCTG GTGGTTGATG GTTTTCCTCC 360 

CXXTTACCTTT TTCTTTTTGAG TTTATTCTGA TTGATTTTTT TTCTTGGTTT CTGGATAAAC 420 

10 CACCXrrCTGG GGACAGGATA ATAAAACATG TAATATTTTT AAGAAGGAAA AAAAAAAAAA 480 

AAAAAACTNG GGGGGGGCCC CGAA 504 



15 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRXPTICW : SEQ ID NO: 52: 

NAAGTATCTT GGCGAGTITA TTACAGAGGA CGATAAATGA TTCCATGTGG ATAGGGCATA 60 

ACATACAGAG AATGAGACTA TGCCAGAAAT GGGAGGAGGC ATTTGAAACA ACATGAGTAT 120 

30 

CTCAGGGACA GATGGATTGA TTCTGCTATT GGTAGGCCTG GAAGCAANGG TCAGAAGTAG 180 

CAAAAAATGG ATACCAAAAG CACTATIWGT CACCCAAGCT 7VAGTGGAATA GCTOGCCCAG 240 

35 TAGGAGAAAT GGAGGTTTTG CTCTACACTA AGTTCTCCAA CTCTTGATAA GCCTCCAAAA 300 

ACAAATGTTA GGGGAAAAAA'AOSCAGCTOG TTATGAAAAG M 360 

AAATCAATGT CAATGCTGTT AATAGAATCC TTTTATCTTC AGGACAGAGG CAATGCCCTA 420 

40 

AACAAACACC AGCTCAAGAG CCTCTGATGC CAACCTAGAG GGTACCCAAA CACAAACTTA 480 

GCATAGAGGT AAGAATCTCT ATGTCTTTTG GTGGAGGCAA AGCCATTTGG TTGGTACTTC 540 

45 ACAGGAACAT CTTTCTACCA AGTCTTCATC ATATGGTATG TGCCACGAGT CTCCAGTTGT 600 

TTGCACCACT GTGTCATAGC TGAGAATACG CTGAAAGGTT AGTTTTGATC CTGGAAACCT 660 

ATTTACAATT GCCAGCTGAT GTCCCTGCTG CCACTTAAAA AAGGCTTGQG TCTGGCATAG 720 

GCAGAMAGGC CTGTGGTCCC CTCGTGCCGA TTCTNGGCTC GAGGCCAATT NCCTTAT 777 



(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEJTCTH: 602 hdiSB pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) ~aoPOU)GY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 53: 

5 

ATGACTACAG TGTTATACCC TCCAATCTTT GCAGGTQGGC ATGGAACACT GCTTCTATCA 60 

CTCTGTGCAC GGTATAAATC CATATATCCA CAAAAACACA CATCCATCCA TCAACATATA 120 

10 CATGGTTTGG GATGAGCAGG TCAATAGTTT TGAGAGGGAG TTTGTTCCTT TTTTTTTTCT 180 

CATTATACTC TTAAATTCTT GTCAGTTATC AAACAAACAA ACAGAAAAAT TGTTTGGAAA 240 

AACCTTGCAT ACGCCTTTTC TATCAAGTGC TTTAAAATAT AGACTAAATA CACACATCCT 300 

15 

GCCAGTTTTT TCTTACAGTG ACAGTATCCT TACCTGCCAT TTAATATTAG CCTCGTATTT 360 

TTCTCACGTA TATTTACCTG TGACTTGTAT TTGTTATTTA AACAGGAAAA AAAACATTCA 420 

20 AAAAAAGAAA AATTAACTGT AGCGCTTCAT TATACTATTA TATTATTATT ATTATTGTGA 480 

CATTTTGGAA TACTGTGGAA GTTTTATCTC TTGCATATAC TTTATACGGA AGTATTACGC 540 

CTTAAAAATA CGAAAATAAA TTTTACAAGG TTCCGGTTTT GGTGGTGGAA AGAGTAAATT 600 

GA 602 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1749 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGVT: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

AGTCACTGAC TTGGAGCCGC TCGGGGGAAG TCCCGCCCAG ACAGGCGGTG GGTGGGAATG 60 

CCTCACTTCA GTTTGAAGAG GGTCCGGATC CAAAGGGGTT AAAACGAGCG AACCCCGATC 120 

45 CCCGACCACA CTTCCCGCCT CCCTAAAACG CACACCCCGC TAGCCATGGG CAGCCGCGAC 180 

CACCTGTTCA AAGTGCTGGT GGTGGGGGAC GCCGCAGTGG GCAAGACGTC GCTGGTGCAG 240 

GATTATTCCC AGGACAGCTT CAGCAAACAC TACAAGTCCA CGGTGGGAGT GGATTTTGCT 300 

50 

CTGAAGGTTC TCCAGTGGTC TGACTACGAG ATAGTGCGGC TTCAGCTGTG GGATATTGCA 360 

GGGCAGGAGC GCTTCACCTC TATGACACGA TTGTATTATC GGGATGCCTC TGCCTGTGTT 420 

55 ATTATGTTTG ACGTTACCAA TGCCACTACC TTCAGCAACA GCCAGAGGTG GAAACAGGAC 480 

CTAGACAGCA AGCTCACACT ACCCAATQGA GAGCCGGTGC CCTGCCTGCT CTTCGCCAAC 540 

AAGTGTGATC TGTCCCCTTG GGCAGTGAGC CGGGACCAGA TTGACCGGTT CAGTAAAGAG 600 

60 
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AACGGTTTCA CAGGTTGGAC AGAAACATCA 
ATGAGAGTCC TCATTGAAAA GATGATGAGA 
5 ACCCAAGGGG ACTACATCAA TCTACAAACC 
TGTTTGGCTT ATTTTCCATC CCAGTTCTGG 
CCXACXTOAC CATTTTATTA AGTACATTTG 

10 

GGGCCCATTG TCACTTAGAA AAGACACCTG 
TTAGCCTTTC ACATGTTGCT GRCTCACATT 
15 TTCTCATCAG CCCTCAATTT GTGATCCGGA 
TGCGTTTTAG AGATCATAAT TCTCACCTAC 
TTGATATCAT GACTTCCAAT TGAGAGGAAA 

20 

TGTAGGCXX3T TGTTTCAGAT TCTTTCTGTC 
AGAAQGAGGG GTCTGGGCAT CTGTGGATTT 
25 GTATITTTGA AACTTCTAAC GTCATAATTA 
CAAGTTTTTT GGCCX3GGCAT GGTGGCTCAT 
GGCAGGCGGA TCACATGAGG CCAGGAATTC 

30 

CGTCTCTACT AAAAGTACAA AAATTAGCCA 
TACTCTGGAG ACTGAGGTGG GAGAATCGCT 
35 CGAGATCATG CCACCGCACT TCAGCCTGGG 
AAAAAAAAAA AAAACTCGAG GGGGGGCCCG 
ACAATCNAA 

40 



276 

GTCAAGGAGA ACAAAAATAT TAATGAGGCT 660 

AATTCCACAG AAGATATCAT GTCTTTGTOC 720 

AAGTCCTCCA GCTGGTCCTG CTGCTAGTAG 780 

GAGGTCTTTT AAGTCTCTTC CCTTTGGTTG 840 

AATTGTCTCC TGACTACTGT CCAGTAAGGA 900 

GAACCCATGT GCATTTCTGC ATCTCCTGGA 960 

AGTGCCAGTT AGTGCCTTCG GTGTAAGATC 1020 
ATTTTGTGAG AAGGATTAGA AATCAGCACC ■ 1080 

TTCTGAGCTT ATTTTTCCAT TTGATATTCA 1140 

ATGAGATCAA ATGrCATTTC CXIAAATTTCT 1200 

1TGGAATGTA AACATCTGAT TCTQGAATGC 1260 

TTGGCTACTA GAAGTGTCCC AGAAGTCACT 1320 

AGTTTCTCTT GTCTTGGCAT CAAGAATAGT 1380 

GCCKGTAATC CCAGCACTTG GGGAGGCCAA 1440 

GAGACCAACC TGGTCAGCAT GGCAAAACCC 1500 

QGCGTGATGG CACGTGTCTG TAATCCCAGC 1560 

TGAGACTGQG AGGCAGAGGT TGCAGTGAAC 1620 

TGACAGAGAA GGACTCCGTC TCAAAAAAAA 1680 

GTACCCAAAT CGCCSTOATA CTGATCCTAW 1740 

1749 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
50 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

AAAGAGATGG GCTCTTTATT TTCTCGAAAA ACCAATTTGG AGTTACTCAT TTTTCCATAA 60 

CATTAAATTT CTTACAGTGA ACTACATATT GTCCATAAGT GCTTCATCAG GACTCATCGC 120 

CCTCCTGTCT ACTGGCTCCA AATAGACCAT GTCAGCTTCA CCCCCTGGCT TTGTGTCTAT 180 

60 GGGTGGCCTG TGGTATATGG AAAAGTAGCA GGGTGGTCAG GGTGGGAGAC ACAAGATGTT 240 
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TTTATAGTCT AGAGCCTTTA AAAAACCCAG CAGAATGTAA ITCAGTATTT G^TOATTOGC 300 

^ TGTTmTGA CAGATTGTTG AAATTAAATG AATTGAAAGG GAAACTCAGA GTACTAGGAC 360 

GTITATTAAA AGGAAAAAAA TGTCTTGCAA TGTGCTGTAA TCACAAGAGG AGAAAATAAC 420 

TTGTTTCCTT GATCTGTCAG AGGTCACAGT AACCTGGGCC GAGCTGTTAT TATTTATTAT 480 

10 ATAATAGTAG TAGGAAGTTA ATAACTGGTT CTCTC3TGTTC CAAGCACAAT ATTACAACIT 540 

CTTTTGAACC GTAAATATCA GAATGAATCC tCTTCCCAGG GGATTGAACA GAAGCTTAAT 600 

GTTTACAAGT GTTTGAATTT GTGATCTGAA ATAACACAAA ATTAAAAACA TCATTTCTCT 660 

AATTTTCCAA CTAGAGGAAG AGAAACTTGT GGAAAAGTTC TTTTTTTTIC TrrrriTl ' lT 720 

CTTAAAGAAG GGCAGCCAAG GTAGTAACCT AAAAATAGTG CCCAGGCATA TCAGAGTTGT 780 

20 CCTACGAGGT TAAAGAACAC ACTGTTCCAC TGTATGGCTT TGGCCCTGAG TCGCCAGGGA 840 

GGTCAACTIG ACCCTGCCAT GTTGGTTTGA CTTACTAAGA CACAGGAATC ATrGTTTTCC 900 

TTGACCAGGG TCTCACACCC TGGAGGAATG TTAAGTAAGA GAAAGAACCT CTTTCCTCAA 960 

TATTGACATG TAAAAGACCA AAGTAATTTT TCTGAACTTC TQCAATTCTG AGAACTCTCC 1020 

AAGGAATTTA CAGTGATTTT AGTGCTTGTC AGCATTTTTC CATGAGGACT TTCATACATT 1080 

30 TGACTCTTTA GTTCACAGGT TCCCATTGAT TGTGAGCAAG ATATTTATCT CTTTAGCCCT 1140 - 

TGGGGATCCA GCTGAGAGCA ATCTCTTGCA TTTTTTTACC CGTGTATCTA CAGATATCAT 1200 

TTCTTGTGTA TGCCATGACT TGAAAAAGTT TGGGAAGCTC TTTAGCAATA TCAGCTAAAA 1260 

35 

GGATATGAAA TCACAGGTGA TAGCAGTTGT CATTCAGTAA OTTCCTACAA GCAGCACCCC 1320 

AAAGGAAATA TAGTCCTAAT CTTTACTATC CACTTCTAAA TTTAATGT6A ATTTCATACA 1380 

40 TGTTATTAGT TGTTTTCTTT ATAATTTTAT AAAAATTATT CATCGGGAC?r TTAACTTCCA 1440 

CTTCCATGCT ATCGGATGTG TTGGGCTCCA TGCAAGAACT TQGAAGAAAA ACAGGCAGGA 1500 

ATGCATTTGC ATAATGACCC AGATCATCAT TTTCTGCAAC TGAGAATTAT ATTTCATCAT 1560 

45 

TGCTTCTAGA AGTCTGCAAT TCTTTACTTT TCTTTGGTGC ATTATTATCT AGGTGCCATC 1620 

ACTGGATAAT GTGGAGTGAC TAGAGAAGTC AYATATCACT GTAAGGTACA GTTAGGGGTA 1680 

50 ACACTTTAGA GGTTTATTAT TTTTAAAAAA CTTTTCTTGA ACTCCTGGGC CAACATGGGT 1740 

GAAACCCCX3T CTTCTTACTT AAAAATACCC AAAATTAGGC CAGGGGCGTG GATGGGTGGG 1800 

GTGCCTGTTA ATCTTCAGCT ACTTNGGGGA GGGCTTGAAG CCAGGGAGGA ACTGCCCTX3G 1860 

ANCCCXIGGGG NGGGCCAGNA GGTTTGCCAG TTGAGT 1896 



55 



60 
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(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

TCTTTTTAAA ATAGACATTT GTGGGGCTCA CACAATATAT GAAATAGTAC CCTCTAAAAA 60 

AGAGAAAAAA AAAATCAGGC GGTCAAACTT AGAGCAACAT TGTCTTATTA AAGCATAGTT 120 

TATTTCACTA GAAAAAATTT AATATCAAGG ACTATTACAT ACTTCATTAC TAGGAAGTTC ' 180 

TTTTTAAAAT GACACTTAAA ACAATCACTG AAAACTTGAT CCACATCACA CCCTGTTTAT 240 

TTTCCTTAAA CATCTTGGAA GCCTAAGCTT CTGAGAATCA TGTGGCAAGT GTGATGGGCA 300 

GTAAAATACC AGAGAAGATG TTTAGTAGCA ATTAAAGGCT GTTTGCACCT TTAAGGACCA 360 

GCTGGGCTGT AGTGATTCCT GGGGCCAGAG TGGCATTATG TTTTTACAAA ATAATGACAT 420 

ATGTCACATG TTTGCATGTT TGTTTGCTTG TTGAATTTTT GAACAGCCAG TTGACCAATC 480 

ATAGAAAGTA TTACTTTCTT TCATATGGTT TTTGGTTCAC TGGCTTAAGA GGTTTCTCAG 540 

AATATCTATG GCCACAGCAG CATACCAGTT TCCATCCTAA TAQGAATGAA ATTAATTTTG 600 

TATCTACTGA TAACAGAATC TGGGTCACAT GAAAAAAAAT CATTTTATCC GTCTTTTAAG 660 

TATATGTTTA AAATAATAAT TTATGTGTCT GCATATTGCA GAACAGCTCT GAGAGCAACA 720 

GTTTCCCATT AACTCTTTCT GACCAATAGT GCTGGCACCG TTGCTTCCTC TTTGGGAAGA 780 

GGAAAGGGTG TGTGAACATG GCTAACAATC TTCAAATACC CAAATTGTGA TAGCATAAAT 840 

AAAGTATTTA TTTTATGCCT CAGTATATTA TTATTTAATT TTTTAGGTAA TGCCTATCTC 900 

TTGGTCTATT AAGGAAAGAA GCAATCAGTA GAGAATTCAG GATAGTTTTG TTTAAATTCT 960 

TGCAGATTAC ATGTTTTTAC AGTGGCCTGC TATTGAGGAA AGGTATTCTT CYATACAACT 1020 

TGTTTTAACC TTTGAGAACA TTGACAGAAA TTATGCAATG GTTTGTTGAG ATACGGACTT 1080 

GATGGTGCTG TTTAATCAGT TTGCTTCCAA AGTQGCCTAC TCAAGAGGCC CTAAGACTGG 1140 

TAGAAATTAA AAGGATTTCA AAAACTTTCT ATTCCTTTCT TAAACCTACC AGCAAACTAG 1200 

GATTGTGATA GCAATGAATG GTATGATGAA GAAAGTTTGA CCAAATTTGT TTTTTTGTTG 1260 

TTGTTGTTGT TTTGAATTTG AAATCATTCT TATTCCCTTT AAGAATGTTT ATGTATGAGT 1320 

GTGAAGATGC TAGCGAACCT ATGCTCAGAT ATTCATCGTA AGTCTCCCTT CACCTGTTAC 1380 

AGAGTTTCAG ATCGGTCACT GATAC?rATGT ATTTCTTTAG TAAGAATGTG TTAAAATTAC 1440 

AATGATCTTT TAAAAAGATG ATGCAGTTCT GTATTTATTG TGCTGTGTCT GGTCCTAAGT 1500 
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GGAGCCAATT AAACAAGTTT CATATGTATT TTTCCAGTGT TGAATCTCAC ACACTCTACT 1560 

TTGAAAATTT CCTTCCATCC TGAATAACGA ATAGAAGAGG CXATATATAT TGCCTCCTTA 1620 

5 TCCTTGAGAT TTCACTACCT TTATGTTAAA AGTTGTGTAT AATTGTTAAA ATCTGTGAAA 1680 

GAATAAAAAG TGGATTTAAA TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1740 

AAAAAAAAGG GGG I753 



15 



(2) INFORMATION FOR SEQ ID NO: 57: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTO: 1220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTIC»a: SEQ ID NO: 57: 

GCGGAAGTTA CTGCAGCCGC GGTGTTGTGC TGTGGGGAAG GGAGAAGGAT TTGTAAACCC 60 

CGGAGCGAGG TTCTGCTTAC COGAGGCCGC TGCTGTGCGG AGACCCCCGG GTCAAGCCAC 120 

CGTCATCATG TCTGACCAGG AGGCAAAACC TTCAACTGAG GACTTGGGGG ATAAGAAGGA 180 

30 AGGTGAATAT ATTAAACTCA AAGTCATTGG ACAGGATAGC AGTGAGATTC ACTTCAAAGT 240 

GAAAATGACA ACACATCTCA AGAAACTCAA AGAATCATAC TGTCAAAGAC AGGGTGTTCC 300 

AATGAATTCA CTCAGGTTTC TCTTTGAGGG TCAGAGAATT GCTGATAATC ATACTCCAAA 360 

35 

AGAACTGGGA ATGGAGGAAG AAGATGTGAT TGAAGTTTAT CAGGAACAAA CGGGGQGTCA 420 

TTCAACAGTT TAGATATTCT TTTTATTTTT TTTCTTTTCC CTCAATCCTT TTTTATTTTT 480 

40 AAAAATAGTT CTTTTGTAAT GTGGTGTTCA AAACGGAATT GAAAACTGGC ACCCCATCTC 540 

TTTGAAACAT CTGGTAATTT GAATTCTAGT GCTCATTATT CATTATTGTT TGTTTTCATT 600 

GTGCTGATTT TTGGTGATCA AGCCTCAGTC CCCTTCATAT TACCCTCTCC TTTTTAAAAA 660 

45 

TTACGTGTGC ACAGAGAGGT CACCTTTTTC AGGACATTGC ATTTTCAGGC TTGTGGTGAT 720 

AAATAAGATC GACCAATGCA AGTGTTCATA ATGACTTTCC AATTGGCCCT GATCTTCTAG 780 

50 CATGTGATTA CTTCACTCCT GGACTGTGAC TTTCAGTC3GG AGATGGAAGT TTTTCAGAGA 840 

ACTGAACTGT GGAAAAATGA CCTTTCCTTA ACTTGAAGCT ACTTTTAAAA TTTGAGGGTC 900 

TGGACCAAAA GAAGAGGAAT ATCAGGTTGA AGTCAAGATG ACAGATAAGG TGAGAGTAAT 960 

GACTAACTCC AAAGATGGCT TCACTG/^AGA AAAGGCATTT TT^AGATTTTT TAAAAATCTT 1020 

GTCAGAAGAT CCCAGAAAAG TTCTAATTTT CATTAGCAAT TAATAAAGCT ATACATGCAG 1080 

60 AAATGAATAC AACAGAACAC TGCTCTTTTT GATTTTATTT GTACTTTTTG GCCTCGGATA 1140 
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TGGGTTTTAA ATQGACATTG TCTGTACXIAG CTTCATTAAA ATAAACAATA TTTGTAAAAA 1200 
TCAWAAAAAA AAAAAAAAAA 1220 



(2) INFORMATION FOR SEQ ID NO: 58: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTO: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
15 (D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

TCGCGCCTGC AGACACAGCA TCTACTCAGC GTGGGTCACC TCTGTGAACA TCACTCACTC 60 

20 

CAAGCCTCCC TCAATTTCTG GTGCAGCCCA TCAGGGACCC ACAGCGCCTG GGAGGATCGT 120 

GCGGATCTTG GCCAATGGGG AAATCGTGCA GGACGACGAC CCCCGAGTGA GGACCACTAC 180 

25 CCAGCCACCA AGAGGTAGCA TTCCTCGACA GAGCTTCTTC AATAGGGGCC ATGGTCCTCC 240 

CCCAGGGGGT CCTGGCCCCC GCCAGCAGCA GGCAGGTGCC AGGCTGGC3TG CTGCTCAGTC 300 

CCCCTTCAAT GACCTCAACC GGCAGCTGGT GAACATGGGC TTTCCGCAGT GGCATCTCGG 360 

30 

CAACCATGCT GTGGAGCCGG TGACCTCCAT CCTGCTCCTC TTCCTGCTCA TGATGCTTGG 420 

TGTTCGTGGC CTCCTCCTGG TTGGCCTTGT CTACCTGGTG TCCCACCTGA GTCAGCGGTC 480 

35 ACCTCTGAGG GCTGATAGGG GTGGGTTTGT TGAGAGGGAC TTGCTGGGCC TTGGTGTGAG 540 

AGGAGGCATA TTTGGAGGGG ATCTGGTGGT GCCTTGAAGG TATGATCAGA GAGGGGACCA 600 

CAGGTGTGTG TTTCCCCTTT GTGTTAAGCG TGAGGCAGAG GGAGACGTTA GTCCCAGCAT 660 

40 

TTCCCAAAGT GTGGGTGGGT CCGTTGGTTC CCGAGATACT TTTAGGTGGT ATGGGGCCTG 720 

CATTAAGTGG CACAAAATCA GAGCAAGAAA GCGATGCCCT TCCCAATTCT CTCAATCCTT 780 

45 TTATGCCGAG AAGATCTCAG CTGGATGCCA ACATGTTCCG ATGCCTGTGG AAGACATGCC 840 

GACGTCTCCT CTGCCTAGGG AGCAGGACTT GGGCTTAGGG CAGGTGGAAA AAATTCCAGA 900 

CTTTTTTAGC ACTGTTTTTG TTTTAATGGT ATATTTTTAT TGGCTACTTT ATTGTTTAGG 960 

50 

ACAAGTGGTA GrGGCATTCT ATTTATTGTG ACCTTTTCAA TAAATAGATT TAAGTAAAAA 1020 

AAAAAAAAAA AAAACTCGAG GGGGGGCCC 1049 

55 

(2) INFORMATION FOR SEQ ID NO: 59: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAGAGGATG TGMAGCTAGA GGTCCCCGAT GGCTGGTCGG ATGGGAAGCA CAAGGCTCAG 60 

GGACTGGATT GTAAAGGCAC TAAGTCGTTC TGCGGTGAGA ATCAGACATG GGGGACCTCT 120 

AGCTTCACAT CCTCTTTCCT TGCAGSTCTG GACATCCTGA GCCCAAGTCC CCCACACTCA 180 

GTC3CAGTGAT GAGTGCGGAA GTGAAGGTGA CAGGGCAGAA CCAGGAGCAA TTTCTGCTCC 240 

TAGCCAAGTC GGCCAAGGGG GCAGCGCTGG CCACACTCAT CCATCAGGTG CTGGAGGCCC 300 

CTGGTGTCTA CGTGTTTGGA GAACTGCTGG ACATGCCCAA TGTTAGAGAG CTGGCTCAGA 360 

GTGAdTTGC CTCTACCTTC CGGCTGCTCA CAGTGTTTGC TTATGGGACA TACGCTGACT 420 

ACTTAGCTGA AGCCCGGAAT CTTCCTCCAC TAACAGAGGC TCAGAAGAAT AAGCTTCGAC 480 

ACCTCTCAGT TGTCACCCTG GCTGCTAAAG TAAAGTGTAT CCCATATGCA GTGTTGCK3G 540 

AQGCTCTTGC CCTGCGTAAT GTGCGGCAGC TGGAAGACCT TGTGATTGAG GCTGTGTATG 600 

CTGACGTGCT TCGTGGCTCC CTGGACCAGC GCAACCAGCG GCTCGAGGTT GACTACAGCA 660 

TCGGGCGGGA CATCCAGCGC CAGGACCTCA GTGCCATTGC CCGAACCCTK AANAAAAACC 720 - 

ATTAAAGTTA CGACGGCAGC AGCAGCCGCA GCCACATCTC AGGACCCTGA GCAACACCTC 780 

ACTGAGCTGA GGGAACCAGC TCCTGGCACC AACCAGCGCC ASCCAGCAAG AAAGCCTCAA 840 

AGGGCAAGGG GCTCCGAGGG ANCGCCAAGA TTTGGTCCAA GTCGAATTGA AAGRACTCTC 900 

GTTTCCTCCC TGGGGATGTG GQGTCCCAGC TGCCTGCCTG CCTCTTAGGA GTCCTCAGAG 960 

AGCCTTCTGT GCCCCTGGCC AGCTGATAAT CCTAGGTrCA TGACCCTTCA CCTCCCCTAA 1020 

CCCCAAACAT AGATCACACC TTCTCTAGGG AGGAGKCAAA TGTAGGTCAT GTTTTTGTTC 1080 

GTACTTTCTG TTTTTTGTGA CTTCATGTGT TCCATTGCTC CCCGCTGCCA TGCTCTCTCC 1140 

CTTGTTTCCT TAAGAGCTCA GCATCTGTCC CTGrTCATTA CATGTCATTG AGTAGGTCGG 1200 

TAGCCCTGAT GGGGGTCGCT CTGTCTGGAG CATAACCCAC AGGCGTTTTT TCTGCCACCC 1260 

CATCCCTGCA TGCCTGATCC CCAGTTCCTA TACCCTACCC CTGACCTATT GAGCAGCCTC 1320 

TGAAGAGCCA TAGGGCCCCC ACCTTTACTC ACACCCTGAG AATTCTGGGA GCCAGTCTGC 1380 

CATGCCAGGA GTCACTGGAC ATGTTCATCC TAGAATCCTG TCACACTACA GTCATTTCTT 1440 

TTCCTCTCTC TGGCCCITGG GTCCTGGGAA TGCTGCTGCT TCAACCCCAG AGCCTAAGAA 1500 

TGGCAGCCGT TTCTTAACAT GTTGAGAGAT GATTCTTTCT TGGCCCTGGC CATCTCGGGA 1560 

AGCTTGATGG CAATCCTGGA AGGGTTTAAT CTCCTTTTGT GAGTTTGGTC GGGAAGGGAA 1620 
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The tissue distribution and homology to IgE receptor indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for allergy and 
asthma. 



5 FEATURES OF PROTEIN ENCODED BY GENE NO: 159 

The translation product of this gene shares sequence homology with 
immunoglobin heavy chain which is thought to be important in immune response to the 
antigen. 

This gene is expressed primarily in activated neutrophil and to a lesser extent in 

10 activated T cell, monocyte and heart. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: infection , inflanmiation 
and cancer. Similarly, polypeptides and antibodies directed to these polypeptides are 

15 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., blood cells, and heart, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

20 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to immunoglobin heavy chain variable 
region indicate that polynucleotides and polypeptides corresponding to this gene are 

25 useful for making the ligand to block specific antigen which cause certain disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 160 

The translation product of this gene shares sequence homology with mouse X 
inactive specific transcript protein which is thought to be important in X chromosome 
30 inactivation. 

This gene is expressed primarily in HSA172 cell and to a lesser extent in normal 
ovary tissue, ovarian cancer, frontal cortex and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: ovarian tumor, 

schizophrenia and other neurological disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
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differential identification of the tissue(s) or ceil type(s). For a number of disorders of 
the above tissues or cells, particularly of the inunune and neural system, expression of 
this gene at significandy higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., ovary and other reproductive tissue, and brain and other 

5 Ussue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

10 The tissue distribution and homology to X inactive specific transcript protein 

indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of reproductive system tumors and CNS tumors. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 161 

15 This gene is expressed primarily in adipose cell and to a lesser extent in liver 

and prostate. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: obesity and liver 

20 disorder. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the adipose cell, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., adipose cells, liver, and 

25 prostate, and cancerous and wounded tissues) or bodily fluids (e.g., semm, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

30 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment of obesity and liver disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 162 

The translation product of this gene shares sequence homology with yeast 
35 ubiquitin activating enzyme homolog which is thought to be important in protein 
posttraslation processing. 
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This gene is expressed primarily in stromal cell and to a lesser extent in retina, 
H. Atrophic Endometrium, colon carcinoma and myeloid progenitor cell. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions: defects of stromal cell 
development, neuronal growth disorders and tumors. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inmiunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune and neural system, expression 
10 of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., retinal cells, endometrium, colon, and bone marrow, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
15 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to ubiquitin-activating enzyme homolog 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis or treatment of some type of tumors , fucosidosis and neuronal growth 
disorders. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 163 
This gene is expressed primarily in primary breast cancer and 
hemangiopericytoma and to a lesser extent in adult brain and cerebellum. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer, leukemia 
and cerebellum disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

30 particularly of the immune system and neural system , expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., mammary tissue, brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

35 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of various tumors and 
disease involved in neural system. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 164 

The translation product of this gene shares sequence homology with proline rich 
proteins. Recently, another group has also cloned this gene, calling it CD84 leukocyte 
antigen, a new member of the Ig superfamily. (See Accession No. U82988, see also. 
Blood 90 (6), 2398-2405 (1997).) 

10 This gene is expressed primarily in Weizmann olfactory tissue and 

osteoclastoma and to a lesser extent in anergic T-cell. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: ostsis and inmiune 

15 disease. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
inunune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., olfactory tissue, bone, and 

20 blood cells, and cancerous and wounded tissue:s) or bodily fluids (e.g., semm, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

25 The tissue distribution and homology to the Ig superfamily indicate that the 

protein product of this clone is useful for treatment of osteoporosis, autoimmune 
disease, and other immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 165 

30 This gene is expressed primarily in atrophic endometrium and colon cancer and 

to a lesser extent in some fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors. Similarly, 

35 polypeptides and antibodies directed to these polypeptides are useful in providing 

inununological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system. 
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expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., endometrium, colon, and fetal tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
5 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of tumors, specifically 
endometrium and colon tumors. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 166 

This gene is expressed primarily in human primary breast cancer and to a lesser extent 
in activated monocyte. Although the predicted signal sequence is identified in Table 1, 
other upstream sequences are also relevant. Preferred polypeptide fragments comprise 

15 the amino acid sequence: WQPKlil^ASMGGSVElPFSF^YPWELAXXPXVRISWRRGHFHG 
QSFYSTRPPSIHKDYV^^U-FL^^WTEGQESGFLRISm-RKEDQSVYFCR\^ELDT^ (SEQ ID 
NO: 641), as well as N-terminal and C-terminal deletions. Also preferred are 
polynucleotide fragments encoding these polypeptide fragments. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inununological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the inunune system, 

25 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., mammary tissue, and blood cells, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

30 healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis of breast cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 167 

35 This gene is expressed primarily in fetal tissues and to a lesser extent in adult 

lung. This gene has also been mapped to chromosomal location 9q34, and thus, can be 
used as a marker for linkage analysis for chromosome 9. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 

5 tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the embryo tissues, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., fetal 
tissues, and lung, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

10 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 168 

15 The translation product of this gene shares sequence homology with Ig Heavy 

Chain which is thought to be important in inmiune response. 

This gene is expressed primarily in prostate cancer tissue specifically 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: prostate cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
prostate, expression of this gene at significantly higher or lower levels may be routinely 

25 detected in certain tissues and cell types (e.g., prostate, tissue and cells of the inunune 
system, and cancerous and wounded tissues) or bodily fluids (e.g., seram, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from £Ui 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 169 

The translation product of this gene shares sequence homology with cytosolic 
acyl coenzyme-A hydrolase, which is thought to be important in neuron-specific fatty 
35 acid metabolism. The gene represented by this contig has since been published by Hajra 
and colleagues (GenBank Accession No. U91316). 
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GGGTATATAG ATTGTATTAA AAAAAAAAAG GTATATATGC ATATATCTAT ATATAATATG 1680 

ACGCAGAAAT AAATCTATGA GAAATCTATC TACAAAMWAA AAAAAAAAAA AAAAAAAAAA 1740 

AGGAATTCGA TNTCAAGCIT ATCGATACCG TCNACC 1776 

(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTTH: 443 base pairs 
J 5 (B) TYPE: nucleic acid 

(C) STRANDEENE5S_:,;|U>uble . , 

(D) TOPOLOGY: linear 



20 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ACAGATAAAT AAATAAATAA TAAATTAAAT TAAATAAAAA ATCTGAGCTA ATCTGAATAA 60 

ATTGAGAGAT TTCACATGAA AGCCAGGATT TCTGCCTTCC CAGGAACAGT CAGAAGAGCT 120 

25 AGCTAGCAAC ACTOGTCTGC TTGGCTACCT TCTTTGGAAC AACATGAAAT CTAGCTCCCT 180 

riT i T r iT rr r rri TGOCCC acttcatcca ttcacatgac ctgcctggcc tctgcaggta 240 

AGTTGAGTATG CAACAAAAAT GTAGCACAGG TrTTGTCGCT GAACTACGTG GTTTCAGGTC 300 
CAGCTCTCCC ACTTGCTAGC ATCACCTCGT GCCGAATTCC NGCACGAAGT TTTTTTrrrr 360 
irrrrcAGTG CTCCAGTCCC CCTATTOGAG AATCCTGCCC CCCCCTQGGA CAGAATGTTC 420 

443 

35 ACCCTGGCCC CGCGANTCCC TGA 

40 (2) INFORMATION FOR SEQ ID NO: 61: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2888 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOUOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

50 TTAATOTTGr CAATAACCAC CAGGCCAAAC AGAATTTATA TGACCTGGAT GAAGATGATG 60 

ATGGTATAGC TTCCGTICCT ACTAAACAGA TGAAGTTTGC AGCCTCAGGC GNCTTTCTCC 120 

ACCACATGGC TGGGCTAAGC AGTTCCAAGC TTTCCATGTC CAAGGCCCTC CCTCTCACCA 180 

AAGTQGTTCA GAATCATCCA TACACAGCTC CTGCTCTCCC TTCCTCTATT CGAACAAAAG 240 

CCTTGACCAA CATOTCCCGG ACACTGGTGA ACAAGGAAGA ACCCCCCAAA GAGCTGCCAG 300 

60 CTCCTGAGCC TCnTCTCAGC CCAriGGAAG GCACCAAGAT GACTGTGAAT AATCTGCACC 360 
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TCCAGGGGTC AGTCTGTCAG GCAGGAAGGA CCCAGGATTT GAACCXAGCT TCAGTGTGCA 2220 

GGCTCTGAGG CTGCCCAGGA CGGGAAAGTC CAAGGAAGGG GCCTGGTQGT GCTCCACTTG 2280 

5 

CAGriXl-lTTA AAGAATGCTG CTTTTTATTC TCCTAACCCT TTCAAGTGGG TGCAGACTTC 2340 

TCGTTAGCAG CTGGAAGACA TTCCTCCCAC ACTTTTCCCT TCCTGGCCCA AGAGAGCATC 2400 

10 CAGAAGGCAG TAGGACCTGG TTTTTCAGGT ACTGGGAGCC GGGGGCTCAC TGCTTGCACT 2460 

GTGCTTAGGG TAGGGATGGT AAATATCCTC CCTGCATGGC TTTATCCTCC CTCTCATCCC 2520 

AAAGCAGGTA TCTTCTGGTT GTCACAGAGT TTCATTGAGT CX^WXTTGCAG CXACGTGGCX 2580 

15 

ATCTQGAGCT GGTGCTATAG GTGACCATCT OGTACATTGA GGGGACXTTCT TTGCCTCCTC 2640 

CACTCTATAA GCAGTCATCT TGGGAGACCG GGAGGAGAAG GTGGTGGGCT AGTCCTGTGT 2700 

20 CCTCCTCCAC TTCCCATGCC TCTATGTTAC CX::ATCTGTGT CTCCTGTGCA GAAGGAGAGG 2760 

AAGGGGCATT AAGAGATGAA GGGTGATTAT GTATTACTTA TCCATTTCTG AATAAACATT 2820 

TGTTATTCCT AAAAAAAAAA AAAAAAAACT CGAGGGGGGG CCCGGWACCC AWATCGCCSK 2880 

AAAGTGAG 2888 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LErraTH: 1851 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: "linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CACTAGTATA ATTTATAATT ATAACCTATT CTGATTTCTT TTCAAATATT AGGTGTCCTA 60 

GTTGCCTATG AAGGTTTGCC ACTTCATCTT GCACTGTTCC CCAAAdTTG GACTGAGCTA 120 

45 TGCCAGACTC AGTCTGCTAT GTCAAAAAAC TGCATCAAGC TTTTGTGTGA AGATCCTGTT 180 

TTCGCAGAAT ATATTAAATG TATCCTAATG GATGAAAGAA CTTTTTTAAA CAACAACATT 240 

GTCTACACGT TCATGACACA TTTCCTTCTA AAGGTTCAAA GTCAAGTGTT TTCTGAAGCA 300 

50 

AACTGTGCCA ATTTGATCAG CACTCTTATT ACAAACTTGA TAAGCCAGTA TCAGAACCTA 360 

CAGTCTGATT TCTCCAACCG AGTTGAAATT TCCAAAGCAA GTGCTTCTTT AAATGGQGAC 420 

55 CTGAGGGCAC TCGCTTTGCT CCTGTCAGTA CACACTCCCA AACAGTTAAA CCCAGCTCTA 480 

ATTCCAACTC TGCAAGAGCT TTTAAGCAAA TGCAGGACTT GTCTGCAACA GAGAAACTCA 540 

CTCCAAGAGC AAGAAGCCAA AGAAAGAAAA ACTAAAGATG ATGAAGGAGC AACTCCCATT 600 

60 
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40 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3542 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEENESS: doiible 

(D) TOPOLOGY: linear 



AAAAGGCGGC GTGTTAGCAG TGATGAGGAG CACACTGTAG ACAGCTGCAT CAGTOACATC 660 

AAAACAGAAA CCAGGGAGGT CCTGACCCCA ACGAGCACTT CTGACAATGA GACCAGAGAC 720 

TCCTCAATTA TTGATCCAGG AACTGAGCAA GATCTTCCTT CCCCTGAAAA TAGTTCTCTT 780 

AAAGAATACC GAATGGAAGT TCCATCTTCG TTTTCAGAAG ACATGTCAAA TATCAGGTCA 840 

CAGCATGCAG AAGAACAGTC CAACAATGGT AGATATGACG ATTGTAAAGA ATTTAAAGAC 900 

CICCACTGTT CCAAGGATTC TACCCTAGCC GAGGAAGAAT CTGAGTTCCC TTCTACTTCT 960 

ATCTCTGCAG TTCTGTCTGA CTTAGCTGAC TTGAGAAGCT GTGATGGCCA AGCTTTX3CCC 1020 

15 TCCCAGGACC CTGAGGTTGC TTTATCTCTC AGTTGTGGCC ATTCCAGAGG ACTCTTTACT ' 1080 

CATATGCAGC AACATGACAT TTTAGATACC CTGTGTAGGA CCATTGAATC TACAATCCAT 1140 

GTCGTCACAA GGATATCTGG CAAAGGAAAC CAAGCTGCTT CTTGACATTA GG1CTAGCAT 1200 

20 

GTCTACTTTT AAGTCCCTCA CCCCCAACCC CCATGCTGTT TGTATAAGTT TTGCTTATTT 1260 

GTTTTTGTGC TTCAGTTTGT CCAGTGCTCT CTGCTTGAAT GGCAAGATAG ATTTATAGGC 1320 

25 TTAATTCTTG GTCAGGCAGA ACTCCAGATG AAAAAAACTT GCATCTTCAG TATACTTCCT 1380 

AAAGQGCAAT CAGATAATGG ATATGTTTTA TGTAATTAAG AGTTCACTTT AGTOGCTTTC 1440 

ATTTAATATG GCTGTCTGGG AAGAACAGGG TTGOCTAGCC CTGTACAATG TAATTTAAAC 1500 

30 

TTACAGCATT TTTACTGTGT ATGATATGGT GTCCTCTGTG CCAGTTTTGT ACCTTATAGA 1560 

GQCAGATTGC CTCCGATCGC TGTGGTTCTT ATTATCAAAA TTAAGTTTAC TTGTATACGG 1620 

35 AACAACCACA AGAAATTTGA TTCTGTAAAG AATCCTCTTT AGCTGTGGCC TGGCAGTATA 1680 

TAAATGGTGC .TTTATTTAAC AGAATACCTG -TGGAGGAAAT AAAGGAGACT TGATCTAAAA 1740 

ATAATTGTTT TATTTTTATT GACATGACTG ATTGATTGCT ATTCTGTGCA CTTAATTAAA 1800 

CTGATTGTGA TGACTTWWAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 1851 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

TCCAATGCTG ATGAGCGTCT TCGCTGGCAG GCCAGCTCCT TGCCTGCTGA TCACCTTTGC 60 

ACAGAAAATG CCATCATGCT GAAACGATTC AATAGGTATC CGCTGATCAT TGACCCCTCT ' 120 

60 GGACAGGCCA CAGAATTCAT TATGAATGAA TATAAGGWTC GTAAGATCAC ACGGACCAGC 180 
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TTCCTGGATG ACGCCTTCAG AAAGAACTTA GAGAGTGCAC TGAGATTCGG TAACCCCCTT 240 

^ CTGGTCCAGG ATGTGGAAAG CTACX3ATCCA GTTTTGAACC CX3GTGCTGAA CCXmSAAGTG 300 

CGGCGAACAG GGQGGAGAGT GCTGATCACT CTCGGGGACC AGGACATAGA CCTGTCGCCA 360 

TCGTTTGTCA TCTTCCTCTC CACCCGGGAT CCAACTGTCG AGTTCCCACC AGATCTCTCT 420 

10 TCCCGGGTTA CTTTTGTAAA CTTCACAGTT ACCCGTAGCA GTTTACAAAG CXIAGTGTCTA 480 

AATGAAGTAC TTAAAGCAGA AAGACCTGAT GTGGACX3AGA AACGATCTGA TCTTCTTAAA 540 

CTTCAAGGGG AATTTCAGCT CCGTTTGCGT CAGCTGGAAA AATCTCTACT ACAAGCTCTC 600 

AACGAGGTGA AAGGQCGCAT TTTGGATGAC GACACGATCA TAACCACTCT GGAGAACCTC 660 

AAGAGAGAGG CTGCAGAGGT CACCAGGAAA GTTGAGGAGA CGGACATTGT CATGCAGGAG 720 

20 GTGGAGACCG TGTCCCAGCA GTACCTCCCG CTCTCCACCG CXTTGCAGCAG CATCTACTTC 780 

ACCATGGAGT CCCTCAAGCA GATACACTTC TTGTACCAGT ACTCCCTCCA GTTTTTCCTO 840 

GACATTTATC ACAACGTCCT ATACGAGAAC CCGAACCTGA AGGGTGTCAC CGACCACACA 900 

CAGCGCCTGT CCATTATAAC AAAGGACCTC TTCCAGGTGG CGTTTAACCG AGTGGCTCGA 960 

GGCATGCTGC ATCAGGACCA CATTACCTTT GCCATGCTGC TGGCAAGAAT CAAACTGAAG 1020 

30 GGCACCGTGG GGGAGCCCAC CTACGATGCA GAATTCCAGC ACTTCTTGAG AGGAAATCAG 1080 

ATTGTCCTGA GTGCTQGCTC CACCCCCAGG ATCCAGGGCC TGACTGTGGA GCAGGCGGAG 1140 

GCGGTGGTGA GGCTGAGCTG CCTTCCCGCG TTTAAGGACT TGATTGCAAA GGTTCAGGCA 1200 

35 

GACGAGCAAT TTGGCATCTG GCTGGACAGC AGCTCCCCGG AGCAGACTGT GCCCTACCTC 1260 

TGGAGTGAAG AAACACCTGC AACACCCATT GGCCAGGCCA TCCACCGCCT GCTCCTGATC 1320 

40 CAGGCTTTCC GGCCCGATCG CCTGTTGGCC ATGGCCCACA TGTTTGTTTC AACAAACCTT 1380 

GGGGAGTCTT TCATGTCCAT CATGGAGCAG CCGCTCGACC TGACCCACAT TGTOGSCACA 1440 

GAGGTGAAGC CCAACACTCC TGTCTTAATG TGCTCTGTGC CTGGTTATGA TGCCAGTCGA 1500 

45 

CATGTCGAGG ACCTTGCAGC CGAGCAGAAC ACGCAGATCA CTTCAATTGC AATCGGCTCT 1560 

GCAGAAGGCT TTAACCAAGC AGATAAGGCA ATAAACACCG CTGTAAAGTC GGGCAGGTGG. 1620 

50 GTGATGCTGA AGAATGTGCA TCTGGCCXrCA GGGTGGCTGA TGCAGCTQGA GAAGAACTTC 1680 

CATTCCCTGC AGCCGCATGC CTGCTTCCGA CTCTTCCTCA CTATGGAGAT CAACCCCAAG 1740 

GTGCCTGTGA ATCTGCTCCG TGCGGGCCGC ATCTTTGTGT TCGAGCCACC GCCAGQGKTG 1800 

AAGGCCAACA TGCTGAGGAC GTTCAGCAGC ATTCCCGTCT CACGGATATG CAAGTCTCCX: 1860 

AACGAGCGTG CCCGCTTGTA CITCCTGCTG GCCTGGTTTC ATGCGATCAT CCAAGAACGC 1920 

60 TTACGATACG CACCACTGGG GTGGTCAAAG AAGTATGAAT TTGGAGAGTC TGACCTGCGG 1980 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 883 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



10 


AGGTGATTTT 


AATGATAGGT GTCATATATA GGACGGATAA TCTGTTTACA TTCTG'riX^Tr 


60 


CTCGATGCAC 


TCACAAGCGG GTAACTAGGT GACAAGAAAA CAAAGATCTT ATTCAAAAGA 


120 




GC5TCTTACAG 


CAACCCAACG TCTCATCTTC CCATAGTAAA GATGACGGCG CCTTCAOGTA 


180 


15 


AGCTACAGGC 


AACACCACTT CCGCGTTTCT CTTGCGCCCT GGTCCAAGAT GGCGGATGAA " 


240 




GCCACGCGAC 


GTGTTGTGTC TGAGATCCCG GTGCTGAAGA CTAACGCCGG ACCCCGAGAT 


300 


20 


CGTGAGTTGT 


GGGTGCAGCG ACTGAAGGAG GAATATCAGT CCCTTATCCG GTATGTOGAG 


360 


AACAACAAGA 


ATGCTGACAA CGATTGGTTC CGACTGGAGT CCAACAAGGA AGGAACTCGG 


420 




TGGTTTGGAA 


"'^AVJ'Vi-xvaLyi A J./\XV.w\X\a/U. ^XWl\aAAAA A±\mVj1 1 Lxifii CATCGAGTTT 


480 


25 


GACATTCCTA 


TCACATATCC TACTACTGCC CCAGAAATTG CAGTTCCTGA GCTGGATGGA 






AAGACAGCAA 


AGATGTACAG GGGTGGCAAA ATATGCCTGA CGGATCATTT CAAACCTTTO 


600 




TGGGGCCAGG 


AATGTGCCCA AATTTGGACT AGCTCATCTC ATGGCTCTGG GGCTGGGTCC 


660 


ATGGSTGGCA 


GTGGAAATCC CTGATCTGAT TCAGAAGGGC GTCATCCAAC ACAAAGAGAA 


720 




ATGCAACCAA 


TGAAGAATCA AGCCACTGAG GCAGGGCAGA GGGACCTTTG ATAGGCTACG 


780 


35 


ATACTAWTTT 


CCTGTGCATC ACACTTAACT CATCTAACTG TTCCCCGGAC ANCCTCCACT 


840 




CTAGTTGTTA 


CTAAGTANTG CAGTAGCATT NTGGGGAAGA ACA 


883 


40 










(2) INFORMATION FOR SEQ ID NO: 65: 




45 


(i) 


SEQUENCE CHARACTERISTICS: 






(A) LENGTH: 1541 base pairs 








(B) TYPE: nucleic acid 








(C) STRANDEDNESS: double 








(D) TOPOLOGY: linear 




50 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 






GGCACGAGGT 


GGCCTCTACC CTGGGCTCAT CTGGCTACAC AGGGACTCTA TU^GCTTCCA 


60 


55 


GATTCCCTGG AAACATGCCA CCCGGCATAG CCCTCAACAA GAAGAGGAAA ATACCATTTT 


120 


TAAGGCCTGG 


GCTGTAGAGA CAGGGAAGTA CCAGGAAGGG GTGGATGACC CTGACCCAGC 


180 




TAAATGGAAG 


GCCCAGCTGC GCTGTGCTCT CAATAAGAGC AGAGAATTCA ACCTGATGTA 


240 


60 


TGATGGCACC 


AAGGAGGTGC CCATGAACCC AGTGAAGATA TATCAAGTGT GTGACATCCC 


■ 300 
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JCCCAG 


GGCTCGATCA TTAACCCMG ATCCACAGGG TCTCXTTCCCT C3GGATCAGAA 


360 


TAATGAT 


GTGGATGAAG 


AAGATGAGGA AGATGAGCTG GATCAGTCGC AGCACCATCT 


420 


. -.TCCAG 


GACACCTTCC 


CCTTCCTGAA CATCAAOXSGT TCTCCCATGG CGCCAGCCAG 


480 


^GCAAT 


TGCAGTGTGG 


GCAACTQCAG CCCGGAGGCA GTGTGGCCCA AAACTGAACC 


540 


3AGATG 


GAAGTACCCC 


AGGCACCTAT ACAGCCCTTC TATAGCTCTC CAQAACTOTO 


600 


J^GCTCT 


CTCCCAATGA 


CTGACXrrGGA CATCAAGTTT CAGTACCGTG GGAAGGAGTA 


660 


2CAGACC 


ATGACCGTGA 


GCAACCCTCA GGGCTGCCX3A CTCTTCTATG GGGACCTOGG 


720 


ZATGCCT 


GACCAGGAGG 


AGCTCTTTGG TCCCGTCAGN CTGGAGCAGG TCAAATTCCC 


780 


rCCTGAG 


CATATTACCA ATGAGAAGCA GAAGCTGTTC ACTAGCAAGC TCCTOGACC?r 


840 


-^GACAGA 


GGACTGATCC 


TGGAGGTCAG CX3GTCATGCC ATTTATGCCA TCAGGCTCTC 


900 


3TGCAAG 


GTGTACTGGT 


CTGGGCCATG TGCCCCATCA CTTGTTGCTC CCAACCTCAT 


960 


::agacaa AAGAAGGTCA 


AGCTATTTTG TCTGGAAACA TTCCTTAGCX5 ATCTCATTGC 


1020 


. ;CAGAAA 


GGACAGATAG 


AGAAGCAGCC ACCGTTTGAG ATCTACTTAT GCTTTCGGGA 


1080 




GATGGGAAAC 


CATTGGAAAG GAAACTCATC TTGGTTCAGG TCATTCCAGT 


1140 


^GCTCGG 


ATGATCTACG 


AGATGTTTTC TGGTGATTTC ACACGATCCT TTGATAGrGG 


1200 


:<3TCCGC 


CTGCAGATCT 


CAACCCCAGA CATCAAGGAT AACATCGTTG CTCAGCTCAA 


1260 




CX3CATCCTTC 


AAACCCAGGA GAGCTGGCAG CCCATGCAGC CCACCCCCAG 


1320 


^ZAACTG 


CCCCCTGCCC 


TGCCTCCCCA GTAATTCTGA ATGCXIATCTT CTTCCTTCTC 


1380 


.:ITATAA 


TATTGTACAT 


ATGGATTTTT TTATTGTTTA GATTTAACCA GCTTTTAAAT 


1440 


, ■"-nTTTC 


TGTGACAGTG 


TTAGAAGTTT GTGATTCTCC AAATATGCCT AGATTTAAAG 


1500 


ATTTAAT 


TTATGGAAAA AAAAAAAAAA AAAAAAAAAA A 


1541 



INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUEJJCE DESCRIPTION: SEQ ID NO: 66: 
-ATGAA TGTTAGAAGG TGCCTGCCGA GGCGGGACAG AGTGTTTGCT CGCGCTQGAG 60 



:;CTGC TCAGCCCTGA GAGTCCCTTC CTGCCCCACC GATACTGGCA CTTTAAAAAG 



120 



7 
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GAAGCTGACC GCACAGTGTC CAGACGAATT GGCCXXXAGA AGATGGGGAG TTCTGTCCTG 180 

CCCTTCTGTG TCTGCGTGAC CTCACCCAGC CTAGGAGGGA GGTGCATTCA GGGTAGATTT 240 

5 GCCTCTCATT CAAAGTTCTG GGGCTTTGGG CGGAAAACAG CCAGCTTTGG CX3CTCTTGGG 300 

GAGACTCCTC CAGACCAC3GA ACCCCAGAAG GAGACAGAGC CTGCCACATC CTCCCACGCC 360 

AGGCCCTGGG CCAGGGTGAT TGGACTGAGA ATTTGGCCAC AACCAAATTG ATGCTGGCTG 420 

GAACXIAGAGG CCAGAAAGCC TOGCXTTTGrC CTCATGTGGG AGCCCTGTCC TCAGCCCTCT 480 

tgtccxx:ttg AGCTCAGTGA ATTCCCACCA GGTGCCCACA GCTCCTGGAC TTCAAATTCT 540 
15 ATATATTGAG AGAGTTGGAG AGTATATCAG AGATATTTTT GGAAAGGAGT TGGTCTATQC ' 600 

AATGTCAGTT TGGAATCTTC TTGAAAGTTT AATGTTTTTA TTAGGAGATT TAAAGAAAAT 660 

AAAGGTCTAC AATATCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 720 

AAAAAAAAAA AA 732 



10 



20 



25 



35 



55 



(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 629 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTAAGGAATT CGGCMCGATC CCGGCAAGTA ACATGACTAA AAAGAAGCGG GAGAATCTGG 60 

GCGTCGCTCT AGAGATCGAT GGGCTAGAGG AGAAGCTGTC CCAGTGTCQG AGAGACCTGG 120 

40 AGGCCGTGAA CTCCAGACTC CACAGCCGGG AGCTGAGCCC AGAGGCCAGG AGGTCCCTGG 180 

AGAAGGAGAA AAACAGCCTA AT6AACAAAG CCTCCAACTA CGAGAAGGAA CTGAAGTTTC 240 

TTCGGCAAGA GAACCGGAAG AACATGCTGC TCTCTGTGGC CATCTTTATC CTCCTGACGC 300 

45 

TCGTCTATGC CTACTGGACC ATGTGAGCCT GGCACTTCCC CACAACCAGC ACAGGCTTCC 360 

ACTTGGCCCC TTGGTCAGGA TCAAGCAGGC ACTTCAAGCC TCAATAGGAC CAAGGTGCTG 420 

50 GQGTGTTCCC CTCCCAACCT AGTGTTCAAG CATGGCTTCC TGGCGGCCCA GGCCTTGCCT 480 

CCCTQGCCTG CTGGGGGGTT CCGGGTCTCC AGAAGGACAT GGTGCTGGTC CCTCCCTTAG 540 

CCCAAGGGAG AGGCAATAAA GAACACAAAG CTGAAAAAAA AAAAAAAAAA AACTCGTAGG 600 

GGGGGCCCGT ACCCAATCGC CCTTNTCGTG 629 



60 
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(2) INFORMATION FOR SEQ ID NO: 68: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1751 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOUDGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

10 

CTGCTAGCCG GCCGGCGCAG GCTGCCGAGC GGGTGAGCGC GCAGGCCAGG CCAAAGCCCT 60 
GGTACCCGCG CGGTGCGGGC CTCAGTCTGC GGCCATQGGG GCGTCCGCGC GGCTGCTGCG 120 
15 AGCGGTGATC ATGGGGGCCC CGGGCTCGGG CAAGGGCACC GTGTCGTCGC GCATCACTAC 180 



ACACTTCGAG CTGAAGCACC TCTCCAGCGG GGACCTGCTC CGGGACAACA TGCTGCGGQG 240 



CACAGAAATT GGCGTGTTAG CCAAGGCTTT CATTGACCAA GGGAAACTCA TCCCAGATCA 300 

20 j 

TGTCATGACT CGGCTGGCCC TTCATGAGCT GAAAAATCTC ACCCAGTATA GCTGGCTGTT 360 



GGATGGTTTT CCAAGGACAC TTCCACAGGC AGAAGCCCTA GATAGAGCTT ATCAGATCGA 420 
25 CACAGTGATT AACCTGAATG TGCCCTTTGA GGTCATTAAA CAACGCCTTA CTGCTCGCTG 480 



GATTCATCCC GCCAGTGGCC GAGTCTATAA CATTGAATTC AACCCTCCCA AAACTGTGGG 540 



CATTGATGAC CTGACTGGGG AGCCTCTCAT TCAGCGTGAG GATGATAAAC CAGAGACGGT 600 

30 

TATCAAGAGA CTAAAGGCTT ATGAAGACCA AACAAAGCCA GTCCTGGAAT ATTACCAGAA 660 



AAAAGGGGTG CTGGAAACAT TCTCCGGAAC AGAAACCAAC AAGATTTGGC CCTATGTATA 720 
35 TGCTTTCCTA CAAACTAAAG TTCCACAAAG AAGCCAGAAA GCTTCAGTTA CTCCATGAGG 780 



AGAAATGTGT ,GTAACTATTA-ATAGTAAGAT-GGGCAAAGCT GCTAGTCGTT GGATTTAGAA -840- 

GCTGCTTTTC CTAAGACTTC TAGTATGTAT GAATTCTTTG AAAATTATAT TACTTTTATT 900 

40 

TCTACTGATT TTATTTTGGA TACTAAGGAT GTGCCAAATG ATTCGGATAC TAAGATGCAT 960 



CGTTTGAAAT CATCTAGTGT GTTGTATGCA GTTATCCTCA AAAACATCAG CGATGTCTGA 1020 
45 ACCTTTAAAA CATCTGTTAG AGCAAAATTA AAAGAGCATT TGGTAGTAAT CTAACTTTTT 1080 



GTTCAGTTAA TAAGTGGTTG ATAAAGTTTC CATATTTTTC TGGAAAAGTT AAAAAAAGTT 1140 



ACATGTCATT TGGAGAAAAT ACGTAATCAG AAATTTGTGC ATAGATTGAT GCCAAAAAAG 1200 

50 

ACATTTCCAG CATTGTGGAA CATGGTGAGA CACTATATAA AATTCCAGAA AGAAAiSCAAC 1260 



TGGATTTACA GATTTATTGT GAGACACAAA TTCACTGCTG CCTTTACACT AAGAAATGTA 1320 
55 TATGTTAACC ATATATGCTG TATTTATTTT GTCGTTAAGC ATACTTTCAG TTTACTCAGA 1380 



ATTTTCAATT TGCTATAAAG ATGTATCAAT TAGCATATAG AAAAATATTA CTTTAAGATG 1440 



ACTTGTTTCC TTTGAAAATA CCTGTGTACT GAGGGTTATG ATTTGTGTCA AAAATTGACA 1500 

60 
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TAAGTGCTTT TACAAGCACC AAAGTTGAAT GAATTTTCAA CAAAATGTAA TTAAAGTCTA 1560 

TGTTTrCAGT TATGACTCAG GTTAAGAAAT GTGTrTTAGG ATCTACTTGC TGX3TTTTTCT 1620 

TTTTGATCCA AATGTGTCAT CTGCCCTGAT AAATAACAAG TTATNGTACC ATCTCCCXXX3 1680 

CCAATAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG GCCCGGTACC CAATTCTCCX3 1740 

NAATAGGNAG T 1751 

(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TVTE: nucleic acid 

(C) STRANDEDNESS : doiible 
20 (D) TOPOIjOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GGCACGAGAT TATGTATTAA AATGTTTTTG AATTGTGAAA TATTAGAATA TTGTTACTAT 60 

TTGACCCAAC TCAAAATCTC CATQGGAAAA TACCTGTCGA TACCCACAGT ATTGTTGAAA 120 

ATAATCAGAT GCAGTATCAC AGCTGTGTCA GACTCTAGTA CCAGTTGGGC AATCAAGGCA 180 

30 CAGCTAAAAA TTGAAAACAA AGATCTQGAC AACAAAACAG CCAAAGGTGG GGGTCAAGAA 240 

GCTCTGACGl* GTACCTAGCT GTAGAATGCT ATGCACACGT GCCAGGTGTA GTGTGCATAT 300 

CCAGGAAAAA CTGCAGAGAG CCCCAGTCTT CACCTCTGGT TGACCATGAG CTCTGTGTAA 360 

35 

GCAGGAAGTG AAGGCTAAGG CAGATTTAAG CTCTGAAAGC ATTCCACAAC ATACACACAA 420 

ATCGTGCAAA GCATTAAGGA AATCTTGTTA CTGCTAAGTG TTGCTGACCC AGGAACAACT 480 

40 CCTACTCAGC TGGACTTAAA AATAAAAA 508 



45 (2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
55 TACATAGAGC AAAGAGAAAT TTCCAGAATT TCTARAATTC TGGAAAGAGA ATTTTCCTGA 60 
GATTGCAGAT TTGCTTGTGT CCTCAGGTGA TGATGAGGGC TGTTTTCCCC TGTTGTCCTT 120 
TCCTCACACT CATGCTTCCT CTCCTAGAGT GTCTGGTTGG CATGATCATG TGCTACCTAG 180 

60 
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GCATTTCTTT CACTCATACA AGGAAAACTG CAGGGTTAAA AAAAAAAAAA AAAAAAAAAA 240 



NCNCG 



245 



<2) INFORMATION FOR SEQ ID NO: 71: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

ATGTTCCTCA TGAGGATGCA CTTGTGCTTC TGCAAGTATT GCTQCAGCTT CATAGTGACT 60 
20 CCCACCAGCA CCAGCAATAC AGCTAGCTAC CTCTGGCCTT GGATCTCAGC CAGCATOGCT 120 

C3GGAGAGGGA GCAGCTGGGC ATGTACCCTA AATGCTGTTA CCAGGGAAGG ACTCCCAGAG 180 
25 '^'^^'^^CAAG TAGGGA,CTTC CTGCAGAGGT GGTACATC5TG CTCTCTC3TAT CCATACTTTT 240 

TTTTTTTTTT TTTTGAGATA GAGTTTCACC CTTGTTGCCC TGGCTGGAGT GCAATCGTCC 300 

GATCTCAGCT CACTGCAACC TCTCTGCCTC CCGGGTTCAA GTGATTCTCC TGCCTCAGCC 360 
30 T 361 



35 (2) INFORMATION FOR SEQ ID NO: 72: 

( i ) SEQUENCE CHARACTERISTICS ; - . _ _ _ - - - 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDECaJESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

45 AGGATCACAC AATAGAGAAC ACTGTAGTAA CATTTCGGTC TGCTCACAAG ACCCAGAACA 60 

TTGATCAGTT tTTGTTGTTG GTTTATTATT TTTCTGTTAA AAAATTGTGA AAAGTTTGTT 120 

TTAGCTAGAT GATATTTTAA TAGCTGCGAG TGCTTTGGAA CTATAAAGAT GTCACTAdT 180 

AACACACATA CCTTATGTTT TGTTTTGTTT TGTTTTACAC TCAGTATAAA TCAGGAGAAG 240 

TTAGCCAACC ATCTAGCATT TAGAATCCTC TTTTTTATTG TCTTCTAAGG ATATGGATGT 300 

55 TCCCATAACA GCAACAAAAC AGCAACAAAA ACATTTCATA AATATCACTT GATAGACTGT 360 

AAGCACCTGC TTAACTTTGT GTCCCAAATA TTTAGTGTGT ATATATATAT ATATATATAC 420 

ACACACACAC ACATATATAT TCAACAAATA AAGCAAAATA TAACATGCAT TTCACATTTT 480 

60 
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GTCTTTCCCT GTTACGATTT TAATAGCAGA ACTGTATGAC AAGTTTAGGT GATCCTAGCA 540 
TATGTTAAAT TCAAATTAAT GTAAAACAGA TTAACAACAA CAAAGAAACT GTCTATTTGA 600 
5 GTGAAGTCAT GCTTTCTATT ATAATAACTT GGCTTCGGTT ATCCATCAAA TGCACACTTA 660 
TACTGTTATC TGATTGTTTA TAATAAAGAA TACTGTACTT ATAAAAAAAA AAA 713 



10 



(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 862 bcise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEZ3NESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GAAAGTCAGA GCTGTCCAAT CCCTCAGCAC CTTTTAGATT TGCTCCAAAT TAGAAACGTG 60 

GGGACTATGT GTTCTGGGCA ATCACAGGTC TGGAAAATGG CTCTGCAGGC TCTTGATAGT 120 

25 

GAGACAC3TGG TCATCTTACC AGACATGCAT CTGATTTTAA GCCTCAGGCT AATCCACAAT 180 

GCTCGGCCAT GCCTATGATT AACAAACAAA AGCAAAATCT GCTTTTATAG TTTAGGAAAC 240 

30 CTGGATAGAA CAGTATTTTT CAGCATTCTT GGATAAAGCA GTTCTGCATT TTTAAATTQG 300 

GACTGCAGAA GTGACTGTCT ATAGTTGTGA AATACAAAAA ATGGTATGTT TGATCAGAAA 360 

AGGAAGCCCG TGCCTGGCAC TTGGAAAGAT ACTGAGCATC ATAACCCTAA TC5AGAAAATG 420 

35 

TAGGCTCTGT GAATGTTAAC TACAAATCAG GTTAGGAAAG CATATGACAC CCTTTGTCAA 480 

ACTAAGCTTC ACTAGGAGGA CCTGTGCTCA TAGAAGAATA TGCTTTAAAA GTATCAATTT 540 

40 TCCACAGTCG ATGATGGAGA AAAGTTCATT TGCACCAGAA TGCTGATAGT CACAATACAC 600 

AGCCTGACAT ATATAACAAT ACAGTTTTCT GTAAACAGAA GTTCTTCCTC TTCCAATTCA 660 

GGAGTCAGTC AGAGCATAAA TATTGCATGT TTCACTTTAG AAACTGATTC ATTTTAGAAA 720 

45 

GCAGATCTGG ATTATTTTGC AGGGTAGAAA TGAAGGCTAT TTCTGGCATT CTTGCTCAAA 780 

AAGTCAATAT ATGTACATTA AGTATAAAAA AGGGTCTCTT TCACCTCTTT TGTTTCGTAG 840 

50 CATTGGCTAC ATAACTCGTG CC 862 



55 (2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4602 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDBNESS: double 
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(P) TOPOLOGY: linear . . 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

GCGAGGGGGC GKGGGGAGCA GCGCCGARGC CGCCGCCTCC GCCTCCGCCG CCTAGGACTA 60 

GGGGGTGGGG GACGGACAAG CCCOGATGCC GGGGGAKACG GAAGAGCCGA GACCCCCGGA 120 

GCAGCAGGAC CAGGAAGGGG GAGAGGCGGC CAAGGCGGCT CCGGAGGACC CGCAACAACG 180 

GCCCCCTGAG GCGGTCGCGG CGGCGCCTGC AGGGACCACT AGCAGCCGCG TGCTGAGGGG 240 

AGGTCGGGAC CGAGGCCGGG CCGCTGCGRC CGCCGCGCMG CAGCTGTGTC CCGCCGGAGA 300 

AGGCCGAGTA TCCCCGCCGG CGAGGAGCAG CCCCAGCGCC AGGCCTCCCG ACGTCCCCGG" 360 

GCAGCAGCCC AGGCCGCGAA GTCCCCGTCT CCAGTTCAGG GCAAGAAGAG TCCGCGACTC 420 

CTATGCATAG AAAAAGTAAC AACTGATAAA GATCCCAAGG AAGAAAAAGA GGAAGAAGAC 480 

GATTCTGCCC TGCCTCAGGA AGTTTCCATT GCTGCATCTA GACCTAGCCG GGGCTGGCGT 540 

AGTAGTAGGA CATCTGTTTC TCGCCATCGT GATACAGAGA ACACCCGAAG CTCTCGGTCC 600 

AAGACCGGTT CATTGCAGCT CATTTGCAAG TCAGAACCAA ATACAGACCA ACTTGATTAT 660 

GATGTTGGAG AAGAGCATCA GTCTCCAGGT GGCATTAGTA GTGAAGAGGA AGAGGAGGAG 720 

GAAGAAGAGA TGTTAATCAG TGAAGAGGAG ATACCATTCA AAGATGATCC AAGAGATGAG 780 

ACCTACAAAC CCCACTTAGA AAGGGAAACC CCAAAGCCAC GGAGAAAATC AGGGAAGGTA 840 

AAAGAAGAGA AGGAGAAGAA GGAAATTAAA GTGGAAGTAG AGGTGGAGGT GAAAGAAGAG 900 

GAGAATGAAA TTAGAGAGGA TGAGGAACCT CCAAGGAAGA GAGGAAGAAG ACGAAAAGAT 960 

GAGAAAAGTC-CACGTTTACe CAAAAGGAGA ~AAAAAGCCTC tZAATCCAGTA TGTCCGTTGT 1020 

GAGATGGAAG GATGTGGAAC TGTCCTTGCC CATCCTCGCT ATTTGCAGCA CCACATTAAA 1080 

TACCAGCATT TGCTGAAGAA GAAATATGTA TGTCCCCATC CCTCCTGTGG ACGACTCTTC 1140 

AGGCTTCAGA AGCAACTTCT GCGACATGCC AAACATCATA CAGATCAAAG GGATTATATC 1200 

TGTGAATATT GTGCTCGGGC CTTCAAGAGT TCCCACAATC TGGCAGTGCA CCGGATGATT 1260 

CACACTGGCG AGAAGCATTA CAATGTGAGA TCTGTGGATT TACTTGTCGA CAAAAGGCAT 1320 

CTCTTAATTG GCACATGAAG AAACATGATG CAGACTCCTT CTACCAGTTT TCTTGCAATA 1380 

TCTGTGGCAA AAAATTTGAG AAGAAGGACA GCGTAGTQGC ACACAAGGCA AAAAGCCACC 1440 

CTGAGGTGCT GATTGCAGAA GCTCTGGCTG CCAATGCAGG CGCCCTCATC ACCAGCACAG 1500 

ATATCTTGGG CACTAACCCA GAGTCCCTGA CGCAGCCTTC AGATGGTCAG GGTCTTCCTC 1560 

TTCTTCCTGA GCCCTTGGGA AACTCAACCT CTGGAGAGTG CCTACTGTTA GAAGCTGAAG 1620 

GGATGTCAAA GTCATACTGC AGTGGGACGG AACGGGTGAG CCTGATGGCT GATGGGAAGA 1680 
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TCTTTGTGGG AAGOGGCAGC AGTGGAGGCA CTGAAGGGCT GGTTATGAAC TCAGATATAC 1740 

TCGGTGCTAC CACAGAGC5TT CTGATTCAAG ATTCAGACTC TGCXXX3ACCT TAGTGGACAG 1800 

GAAGACTTGG GGCATGQGAC AGCTCAGACT TTGTATTTAA AAGTTAAAAA GGACAAAAAA 1860 

AAAATCTAAA GCATTTAAAA TCTAGTX3AAA TAACTGAAGG GCCTCCTCTT TCCATTGTGG 1920 

ATCACAGCAC ACACATACAT ACACCCTCCA CCTCCCCATC CXXTOTTCTC CCTCTGTTGC 1980 

TCCCXTTTATA AAATTGATGT TGTCTTTACC AGAAAGGTAG ACAAAAAAGA AGCAGCAGCA 2040 

GCTCTTAAAG TGAGGGTTAT TCTCATACTC GGTTCCAGCC ATCAGCAGAC TTCCTGCTCA 2100 

TCGGCAGATC CCCCTTTCCA ACCTGTAACT CTGATGTGCT CTGGATCAGC TTTTAACTTT ' 2160 

TAATCATATA TTACTGTCTT CTAAATCCCT TCTCCTCCTC TACTGCTGCC CTATCGTTCT 2220 

GGCTCCTACC CCCTGCGGCA CACTTATCTT CAAATACCAT AGAATTGTAA TCTCTGAAAT 2280 

CATAGCTCTC CAGTGGCTTT TAAAGAAAGC TGGTCCTCAG CACTAACAAA ATCACTACAA 2340 

TAGCCTAC3TG CTTTTTTGGA AGCCITTTTA GGGAAGAATG TTAGGTTCAT GGTAACTAGT 2400 

ATGCTCTTTG AGATTTTTAC AGTCTTGAAA CTTAAGAATT TTGAGAGGCT GAGGAGGGTT 2460 

GTTCAGAATC TAAATTACAG ATAGAOXSATT GOTTCTTGTG AATTTGTTTC TTTTCCTTTT 2520 

TTTTTGTCCC TACCATTTCC TTACATTTCC CTTGGGGCXC ATCTCTGGCT CCTTGCTTTT 2580 

TGTTTCTTGC TTTGCTTTAT CAGTTCATTC CAGCTCCCTG TTAGTGAAGG ACACTGCTGT 2640 

TAGTGAAGGA ACAAAGTCTA TGAGTCCTAA AATTTTAAGT CAAAGAAAAC TGCTCTGTTT 2700 

CCCCTTTAGT AACACTTCTG AAGAGGAAAA ACTTCAATAG CCAAAGTTAA TAATCCTATA 2760 

TAATAATTGC TTTGGCTTTC ACCTAAAATT CTGQGCATCA CAATTTCCTT GGGATAGAGG 2820 

TTGTGTTGGG GAATAGATTG CTTATTGCTG TTCACTGGAG AGAAAAGGTA GTGTTTTTCT 2880 

ACAAGGTCAT ACCGCCAGAA GCXXZCAAATC CTATTTTGGC TCATCTTCAG C3TAAAGAGTA 2940 

ATTCCTATCC TGTGTGCCTC AGAAGCTAGA ATCGAAGGCT TACCCTATTC ATTGTTtATT 3000 

GTCAGAAATG CATGATGGCT CTTGGAAAGA ATGACGTTTT GCTGGAAAAA AAAAAAARAA 3060 

CMGTTTGTGT TTCACAAACA TGGCTTATCA ATTTTTTCAA AGAATTCTTT TTTCCCAAAA 3120 

AGAGGAGTAA CAAAATGTCA TTTCTGAAAG AGGCTTACTT TATACCAACT AGTGTCAGCA 3180 

TTTGGGATGC CAGGGAACAG AGAGTGAGAC ACCTACAATC ACCAGTCTCA AATGCX3CTAT 3240 

TGTTTCTTTT CAGAGTGTTG CAGATTTGCC ATTTCTCCAT AATATGGGGA TAGAAAATGG 3300 

AATAAAGATA GAAGGGATGT AGAATATGCT TTCCTGCXAA CATGGTTTGG AGTCGACTTT 3360 

GGTATATTGA CTAGATTTGA AAATACAAGA TTGATTAGAT GAATCTACAA AAAAGTTGTC 3420 

CTCCTCTCAG GTCCCTTTTA CACTTTTTGA CTAACTAGCA TCTATATTCC ACACTTAGCT 3480 
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TTTTTGTCAC ACTTATCCTT TGTCTCCGTA AATTTCATTT GCAGTQGTTA GTCATCAGAT 3540 

ATTTTAGCCA CCTACACAAA AGCAAACTGC ATTTTTAAAA ATCTTTCTGA GATGGGAGAA 3600 

5 AATGTATTCT CCTTTCCTAT ACCGCTCTCC CAACAAAAAA ACAACTACTTT AGTTCTACTA 3660 

ATTAGAAACT TGCTCTACTT TTTCTTTTCT TTTAGGGGTC AAGGACCCTC TTTATAGCTA 3720 

CCATTTGCCT ACAATAAATT ATTGCAGCAG TTTGCAATAC TAAAATATTT TTTATAGACT 3780 

10 

TTATATTTTT CCTTTTGATA AAGGGATGCT GCATAGTAGA GTTGGTGTAA TTAAACTATC 3840 

TCAGCCGrrTT CCCTC3CTTTC CCTTCTGCTC CATATGCCTC ATTCTCCTTC CAGGG/tfSCTC 3900 

15 TTTTAATCTT AAAGTTCTAC ATTTCATGCT CTTAGTCAAA TTCTGTTACC TTTTTAATAA ' 3960 

CTCTTCCCAC TGCATATTIC CATCTTGAAT TGGTGGTTCT AAATTCTGAA ACTGTAGTTG 4020 

AGATACAGCT ATTTAATATT TCTQGGAGAT GTGCATCCCT ci^l'Cl'i'lir m GTTGCCCAAG 4080 

20 

GTTGTTTTGC GTAACTGAGA CTCCTTGATA TGCTTCAGAG AATTTAGGCA AACACTQGCX: 4140 

ATGGCCGTGG GAGTACTGGG AGTAAAATAA AAATATCGAG GTATAGACTA GCATCCACAT 4200 

25 AGAGCACTTG AACCTCCTTT GTACCTGTTT GGGGAAAAAG TATAATGAGT GTACTACCAA 4260 

TCTAACTAAG ATTATTATAG TCTGGTTGTT TGAAATACCA •rmTTTCTC CTTTTGTGTT 4320 

TITCCCACTT TCCAATGTAC TCAAGAAAAT TGAACAAATG TAATGGATCA ATTTAAAATA 4380 

30 

TmATTTCT TAAAAGCCTT TTTTGCCTGT TGTAATGTGC AGGACCXOTC TCCTTTCATG 4440 

GGAGAGACAG GTAGTTACCT GAATATAGGT TGAAAAGGTT ATGTAAAAAG AAATTATAAT 4500 

35 AAAAGGGATA CTTTGCrTTT CAAATCTTTG TTTTCTCTTA TTCTAGGTAA GGCATATTAA 4560 

AAATAAATAT GTAAAGAAGA- AAAATAAA?\G TTGTCTTGAT GG - - - --46G2 - - 

40 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOliOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CGCGCCCCGG GCCGGCGGGT TTCTCTAACA AATAAACAGA ACCCGCACTG CCCAGGOGAG 60 

CGTTGCCACT TTCAAAGTGG TCCCCTGGGG GAGCTCAGCC TCATCCTGAT GATGCTGCCA 120 

55 

AGGCGCACTT TTTATTOTTA TTTTATTTTT ATTTTTTTTT TAGCATCCTT TTGGGGCTTC 180 

ACTCTCAGAG CCAGTnTTA AGGGACACCA GAGCCGCAGC CTGCTCTGAT TCTATGGCTT 240 

60 GGTTGTTACT ATAAGAGTAA TTGCCTAACT TGATTTTTCA TCTCTTTAAC CAAACTTGTG 300 
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GCCAAAAGAT ATTTGACCGT TTCCAAAATT CAGATTCTGC CTCTGCQGAT AAATATTTGC 360 

CACGAATGAG TAACTCCTGT CACCACTCTG AAGGTCCAGA CAGAAGGTTT TGACACATTC 420 

TTAGCACTGA ACTCCTCTGT GATCTAGGAT GATCTGTTCC CCCTCTGGAT GAACATCCTC 480 

TGATGATCAA GGCTCCCAGC AGGCTACTTT GAAGGGAACA ATCAGATGCA AAAGCTCTTG 540 

GGTGTTTATT TAAAATACTA GTCTCACTTT CTGAGTACCC GCCGCTTCAC AGGCTGAGTC 600 

CAGGCCTGTG TGCTTTGTAG AGCCAGCTGC TTGCTCACAG CCACATTTCC ATTTGCATCA 660 

TTACTGCCTT CACCTGCATA GTCACTCTTT TGATQCTGGG GAACCAAAAT GGTGATGATA 720 

TATAC3ACTTT ATGTATAGCC ACAGTTCATC CXTAACCCTA GTCTTCGAAA TGTTAATATT 780 

TGATAAATCT AGAAAATCCA TTCATACAAT TACAGAATTC AAATATTGCA AAAGGATGTG 840 

TGTCTTTCTC CCCGAGCTCC CCTGTTCCCC TTCATTGAAA ACCACCACGG TGCCATCTCT 900 

TGTGTATGCA GGGCTATGCA CCTGCAGGCA CGTGTGTATG CACTCCCCGC TrGTGTTTAC 960 

ACAAGCTGTG GQGTGTTACG CATGCCTGCT TTTTTCACTT AATAATACAG CTTGGAGAGA 1020 

TTTTTC5TATC ACATTATAAA TCCCACTCGC TCTTTTTGAT GGCCACATAA TAACTACTGC 1080 

ATAATATGGA TACGCCTTAT TTGATTTAAC TAGTTCCCTA ATGATGGACT TTTAAGTTGT 1140 

TTCCTTTTTT TTTCTTTTTT GCTACTGCAA ACGATGCTAT AATAAATGTC CTTATCAAAA 1200 

AAAAAAAAAA AT^AAAAAAAA AAAAAANCCC NGGGGGGGGG CCCCGGGAAC NCAAT 1255 
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(2) INFORMATION TOR SE^^ ID-NO: 76: " " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEITCTH: 475 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GGCACGAGAG AAATGTTTGA TTCTCTTTCC TATTTTAAGG GATCTTCTCT CTTGTTGATG 60 

TTGAAAACTT ACCTTAGTGA AGATGTGTTT CAACATGCTG TTGTCCTTTA CCTGCATAAT 120 

CACAGCTATG CATCTATTCA AAGTGATGAT CTGTGGGATA GTTTTAATGA GGTCACAAAC 180 

CAAACACTAG ATGTAAAGAG AATGATGAAA ACCTGGACCC TGCAGAAAGG ATTTCCTTTA 240 

GTGACTGTTC AAAAGAAAGG AAAGGAACTT TTTATACAAC AAGAGAGATT CTTTTTAAAT 300 

ATGAAGCCTG AAATTCAGCC TTCAGATACA AGGTACATGC CCTCTTTCTT TTCATGCCAT 360 

CTCTTTTGCA CTCTCAGGTG GAAATATTTT GAAGTGTTTT ATAATCATAA GTTCTTGTGA 420 
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AACCTAACAA GATTATCCCT TCCTAAGAAT ACTTAACCTT CCTACCAAAT TAAAA 475 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 
10 (B) TYPE: nucleic acid 

<C) STRANDEENESS: double 
(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 77: 

TTCTCTCTGC TCTTCGACTG CACCGCACTC GCGCGTGACC CTGACTCCCC CTAGTCAGCT 60 

CAGCGGTGCT GCCATGGCGT GGCGGCGGCG CGAACCRGCG TCGGGGCTCG CGGCGTOITC 120 

20 GCTCTGGCGT TGCTCGCCCT GGCCCTC5TGC GTGCCCGGGG CCCGGGGCCG GGCTCTCGAG 180 

TGGTTCTCGG CCGTGGTAAA CATCGAGTAC GTGGACCCGC AGACCAACCT GACGGTOK3G 240 

AGCGTCTCGG AGAGTGGCCG CTTCGGCGAC AGCTCGCCCA AGGAGGGCGC GCATX3GCCTC 300 

GTGGGCGTCC CGTGGGCGCC CGGCGGAGAM CTCGARGGCT KCGOGCCCGA CACGCGCTTC 360 

TTCGTGCCCG AGCCCGGCGG CCGAGGGGCC GCGCCCTGGG TCGCCCTCGT GGTCGTGQGG 420 

30 GCTGCACCTT TCAAGGACAA AGTGCTGGTG GCGGCGCNGA ANGAA 465 



35 (2) INFORMATION FOR SEQ ID NO: 78: 

{V. SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1907 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

45 ACATGCAGCC CAACTACAGA TTCTTATGGA ATTCCTCAAG GTTGCAAGAA GAAATAAGAG 60 

AGAGCAACTG GAACAGATCC AGAAGGAGCT AAGTGTTTTG GAAGAGGATA TTAAGAGAGT 120 

GGAAGAAATG AGTGGCTTAT ACTCTCCTGT CAGTGAGGAT AGCACAGTGC CTCAATTTCA 180 

AGCTCCTTCT CCATCACACA GTAGTATTAT TGATTCCACA GAATACAGCC AACCTCCAGG 240 

TTTCAGTGGC AGTTCTCAGA CAAAGAAACA GCCTTGGTAT AATAGCACGT TAGCATCAAG 300 

55 ACGAAAACGA CTTACTGCTC ATTTTGAAGA CTTGGAGCAG TGTTAdTTT CTACAAGGAT 360 

GTCTCGTATC TCAGATGACA GTCGAACTGC AAGCCAGTTG GATGAATTTC AGGAATGCTT 420 

GTCCAAGTTT ACTCGATATA ATTCAGTACG ACCTTTAGCC ACATTGTCAT ATGCTAGTGA 480 

60 
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TCTCTATAAT GGTTCCAGTA TAGTCTCTAG TATTGAATTT GACCGGGATT GTGACTATTT 540 

TGCGATTGCT GGAGTTACAA AGAAGATTAA AGTCTATGAA TATGACACTG TCATCCAGGA 600 

TGCAGTGGAT ATTCATTACC CTGAGAATGA AATGACCTGC AATTCGAAAA TCAGCTGTAT 660 

CAGTTOGAGT AGTTACCATA AGAACCTGTT AGCTAGCAGT GATTATGAAG GCACTGTTAT 720 

TTTATQGGAT GGATTCACAG GACAGAGGTC AAAGC5TCTAT CAGGAGCATG AQAAGAGGTG 780 

TTGGAGTGTT GACTTTAATT TGATGGATCC TAAACTCTTG GCTTCAGGTT CTGATGATGC 840 

AAAAGTGAAG CTGTCGTCTA CCAATCTAGA CAACTCAGTG GCAAGCATTG AGGCAAAGGC 900 

TAATGTGTGC TGTGTTAAAT TCAGCCCXTTC TTCCAGATAC CATTTGGCTT TCGQCTGTQC * 960 

AGATCACTGT GTCCACTACT ATGATCTTCG TAACACTAAA CAGCCAATCA TOGTATTCAA 1020 

AGGACACCGT AAAGCAGTCT CTTATGCAAA GTTTC3TGAGT GGTGAGGAAA TTGTCTCTGC 1080 

CTCAACAGAC AGTCAGCTAA AACTGTGGAA TCTAGGGAAA CCATACTCCC TACGTTCCTT 1140 

CAAC3GGTCAT ATCAATGAAA AAAACTTTGT AGGCXTTOGCT TCCAATGGAG ATTATATAGC 1200 

TTGTGGAAGT GAAAATAACT CTCTCTACCT OTACTATAAA GGACTTTCTA AGACTTTGCT 1260 

AACTTTTAAG TTTGATACAG TCAAAAGTGT TCTCGACAAA GACCGAAAAG AAGATGATAC 1320 

AAATGAATTT GTTAGTQCTG TGTGCTGGAG GGCACTACCA GATGGGGAGT CTAATGTGCT 1380 

GATTGCTGCT AACAGTCAGG GTACAATTAA GGTGCTAGAA TTGGTATGAA GGGTTAACTC 1440 

AAGTCAAATT GTACTTGATC CTGCTGAAAT ACATCTGCAG CTGACAATGA GAGAAGAAAC 1500 

AGAAAATGTC ATGTGATGTC TCTCCCCAAA GTCATCATGG GTTTTGGATT TGTTTTGAAT 1560 

ATTTTTTTCT TTTTTTCTTT TCCCTCCTTT ATGACCTTTG GGACATTGGG AATACCCAGC" ' 1620 

CAACTCTCCA CCATCAATGT AACTCCATGG ACATTGCTGC TCTTGGTGGT GTTATCTAAT 1680 

TTTTGTGATA GGGAAACAAA TTCTTTTGAA TAAAAATAAA TAACAAAACA ATAAAAGTTT 1740 

ATTGAGCCAC AGTTGAGCTT GGAAAGTTTT TGTCAAATGC NGCAAGAGAT AACTCTTTTT 1800 

ANGAAGTAGC ATATGTGAAC TATAATGTAA CAGTGAATAA TTTGTAAAGT TCGTATTTCC 1860 

CAACCTCTTT GGGAATTACA CATATCAATA TAAACAAAAT ATAAAGT 1907 



50 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

60 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
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GCTGGGCTGT CCXTCKCSGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATCA 60 

CCCATGACTG ATGCTTATAA ATTTGTCAAA GGCAAACGAC CAATTATCTC CCCAAACCTT 120 

5 

AACTTCATGG GGCAGTTGCT AGAGTTCGAG GAAGACCTAA ACAAaXTTGT GACACCX5AGA 180 

ATCCTTACAC CAAAGCTGAT GGGCXTTGGAG ACGGTTGTGT GACAATGGTC TGGATGGAAA 240 

10 GGATTGCTGC TCTCCATTAG GAGACAATGA GGAAGGAGGA TGGATTCTGG TTTrrTTTCT 300 

TTCTTTTTTT TTTTGTAGTT GGGAGTAAGT TTGTGAATGG AAACAAACTT GTTTAAACAC 360 

TTTATriTTA ACAAGTGTAA GAAGACTATA ACTTTTGATG CCATTGAGAT TCACXMCCXA 420 

CAAACTGACA AATTAAGGAG GTTAAAGAAG TAATTTTTTT AAGCCAACAA TAAAAATATA 480 

ATACAACTTG TTTCTCCCCC TTTTCCTTTT AAGCTATTTG TAGAGTTTAT GACTAAATAG 540 

20 TCTGTGCAGG TTCATAGACC GAAGATACTA CACACTTTAA ACCAATTAAA AAGAACCAAA 600 

AGTAAATAGA AAAGACATTG AATCACCAAG GCCTGGGATC AACCTGGGCT GTCCACACAG 660 

AAAACAAAAA CCCAACCAAA CCAAGCCCTG TTGTGCTCAC TGGTGCAAAG AGAAGATCAG 720 

25 

GGCAGCTTAA GTGGTCTAAG RATCCTTCAG GCATTCTTTA AGGAGAAAAA GGATACCTTT 780 

GATTTTGTX3T GTTTCATGCT CTGGATTTTT TTTTTTTTTC CTTCTCTGGG TTTAAGAGAT 840 

30 TTTTTTTGAA ATAGTGAGGA ACTGACCATT ATATGCCTTC ACTGGCTTCT TGTGCAATAA 900 

TATGATGTTT TAAGTGTGCA AACAAGTTAG AGCTGGCAGC TGAATGATAG ACAAATAGTG 960 

CAAATTTGCC AGCTTGGAGA TAGAAAGGAA TTCAACAATA TATCAAATAC TTTCCTTCCC 1020 

35 

ACCTTTTTCC TTTTTTTTTT TTTTTTCTGA TTTGATTCTG GTTACAGTGC CATAAACXTTT 1080 

GTTACATATG TATATCAGAA TGTAAGAAAA AAAAATTTAT TTAAAAATAT TTTTCGCAAA 1140 

40 AAAAAAANNA AAAAACTCGA GQGGGGCC 1168 



45 (2) INFORMATION FOR SEQ ID NO: 80: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1285 base pairs 
(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOIXX3Y: linear 

(xi) SEQUENCE DESCRIPTIC»I : SEQ ID NO: 80: 

55 AGAAAATCAC ATCCTAACAA AGAAGTCTGT CTAAGACAGT ACATCTCCTG TTGAACTTGC 60 

ATCTTTCCAC AGGACTTTCT GTTTTTAGGG ATGAGACTAT TCTCTGCTTC ATCAAGGAAA 120 

GAGAAATGTT CAGGGTTGTA GGGATGGCAC ACTTATTAGT TCTGCCTGTC TGAAAGGTTC 180 

60 
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CIGCAGGACA GTTTGGTCAG AGCTGCAATT CTTAGTCCAT 

CTTcrrrccc tttcctgtct caggaatcag ctgagaattc 

GCCCCTTACT GTGATTTGTT GGTTGCACTT TCATTTGCTT 

gactcctcag acttcaccta actttggaaa ctctcttttg 
aattctgtgc tgcctgagcc ctagaatttt cccaccaacg 
aagttgctgg atctagttga tatttaaaca atatctagtt 
atccagaaac cagtatctct naaaaacaac ctctcatacc 
gcgtctgtgt gtgcgcgcat atgtatatag acaggcacat 
cttatgcctc tttggtatct atatctgtga aagttttaat 
ggacctttgt cttctgtgta aatggtacta gagaaaacac 
gttggtttta ttcgacatga aggaaatttc cagataacaa 
actaggggga caaagaaaag caaaactgac cataaaaaac 

CATAAACAGA ATTAGGTAGT ATATTGAAGA CAGCATCATT 
TTGCAAAAAA CATGTACTGA CTTCCCGTTG AGTAATGCCA 
AACTTGCCCT TCATTACATG TTTCAAAGTG GTGTGGTGGG 
AACTGACTGA TAAAGCTGTA CAAATAAGCA GTGTGCCTAA 
ACATGCTTAA TTCACAAATG CTAATTTCAT TATAAATTGT 
AACTATTTTT CTGTATTCCA AGAGCTGAGA TCTTAGATTT 
AATACGAAAA TAATAAACAT TCAAG 



GGTCTAATGC TTGAGTATCT 240 

ATTCGATTGT CATGCCTCTA 300 

TAGTTCTAGA ATCACCTGTT 360 

GAGGCTTCTC ATTTCCCCCT 420 

AATTATTCCA GGTAGATCCT 480 

GATATTTCTC ATTCAGTTQG 540 

TTGTGGACCr AATlTrGTGT 600 

CTTTTTTACT TTTGTAAAAG 660 

GATCTGCCAT AATOTCTTGG 720 

CTATATTATG AGTCAATCTA 780 

CACTAACAAA CTCTCCCTTG 840 

AATTACCTGG TGAGAAGTTG 900 

AAACAGTTAT GTTGTTCTCC 960 

AGTTGTTTTT TTTATTATAA 1020 

CCAAAATATT GAAATGATGG 1080 

CAAGCAACAC AGTAATGTTG 1140 

TTTGCTAAAA TACACTTTGA 1200 

TATGTAGTAT TAAGTGAAAA 1260 

1285 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81; 
TCTCCAGCCC CAATTTCTAC GCGCACCGGA AGACGGAGGT CCTCTTTCCT TGCCTAACGC 
AGCCATGGCT CGTGGTCCCA AGAAGCATCT GAAGCGGGTG GCAGCTCCAA AGCATTGGAT 
GCTGGATAAA TTGACCGGTG TGTTTGCTCC TCGTCCATCC ACCGGTCCCC ACAAGTTGAG 
AGAGTGTCTC CCCCTCATCA TTTTCCTGAG GAACAGACTT AAGTATGCCC TGACAGGAGA 
TGAAGTAAAG AAGATTTGCA TGCAGCGGTT CATTAAAATC GATGGCAAGG TCCGAACTGA 



60 
120 
180 
240 
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(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 



TATAACCTAC CCTGCTGGAT TCATGGATGT CATCAGCATT GACAAGACGG GAGAGAATTT 360 

CCGTCTGATC TATGACACCA AGGGTCOCTT TGCTGTACAT CGTATTACAC CTGAGGAGGC 420 

5 

CAAGTACAAG TTGTGCAAAG TGAGAAAGAT CTTTGTGGGC ACAAAAGGAA TCCCTCATCT 480 

GGTGACTCAT GATGCCCGCA CCATCCGCTA CCCCGATCCC CTCATCAAGG TGAATGATAC 540 

10 CATTCAGATT GATTTAGAGA CTGGCAAGAT TACTGATTTC ATCAAGTTCC ATTCACCCAG 600 

CCAGGTGGTC TCGTCACCTC AGAGGCTCCG CAGACTCCTG CCCAGGCCAG GACTGAGGCA 660 

AGCCTCAAGG CACTTCTAGG ACCTGCCTCT TCTCACCAAG ATGAACTCAC TGCTTTCTTG 720 

GCAGCTACTG CTTTTCCTCT GTGCCACCCA CTTTGGGGAG CCATTAGAAA AGGTGGCCTC 780 

TGTGQGGAAT TCTAGACCCA CAGGCCAGCA GCTAGAATCC CTGGGCCTCC TQGCCCCSGG 840 

20 GGAGCAGAGC CTGCCGTGCA CCGAGAGGAA GCCAGCTGCT ACTGCCAGGC TGAGCCGTCG 900 

GGGGACCTCG CTGTCCCCGC CCCCCGAGAG CTCCGGGAGC CCCCAGCAGC CGGGCCTGTC 960 

CGCCCCCCAC AGCCGCCAGA TCCCCGCACC CCAGGGCGCG GTGCTGGTGC AGCGGGAGAA 1020 

25 

GGACCTGCCG AACTACAACT GGAACTCCTT CGGCCTGCGC TTCGGCAAGC GGGAQGCGGC 1080 

AiCCAGGGAAC CACGGCAGAA GCGCTGGGCG GGGCTGAGGG CGCAGGTGCG GGGCAGTGAA 1140 

30 CTTCAGACCC CAAAGGAGTC AGAGCATGCG GGGCGGGGGC GGGQGGCGGG GACGTAGGGC 1200 

TAAGGGAGGG GGCGCTGGAG CTTCCAACCC GAGGCAATAA AAGAAATGTT GCGTAACTCA 1260 

AAAAAAAAAA AAAAAAAANC TCGGGGGGGG 1290 

35 



(xi) SEQUENCE DE^JRIPTION: SEQ ID NO: 82: 

TTTATTGTAT TCTGTAACTA TAGAACTTCT ATTTOATTCT TTTTTGGACT TGCTAAGTTG 60 

TCTTTOATGG TTTTWAGTTC CATGCTGAAG TTTTCAGTAT TGACTTATCC CCTTGAACAT 120 

GAGTTGTTTT ATAGACTCTR ATGATTCAAA AATCTTACAT CTTTTGGTAG TCTCTTTCAT 180 

55 TTGTYCACTG TTTCTGTTGA TTCTWACTCA TGGTATTTTA ATTCTTCGTT WTTTTTTTTC 240 

TGTTWAGAWA CATTCTTTGA AAAATAATTT GGAGGAATAT TTGATTCTTA TGAACAAGGC 300 

ATTACTCACC AGAGAAGATT TTTTTGTTYT ACCARGTGCC TARGAATGCT AACAGTCTGG 360 

60 
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GAMCACATAG A^ECACCAGGT GATGAGACAA TCCTGGGART CCTGTTTTAC TTTGGSCCAT 
CriTTTCTCCC AACCCTGTCG GAATARTCAT YCATATCCTA RCTGCAGGCT ARAAGGTQCT 
TTATCAGAGC CCAACTTCGA GGGCTCTGGG CTTTAGCTAC TGTCACCXrCA TCATAACTQA 
GCTTCATGGA TTGATTCTCT TTTTATCTTT CAGATTTTCT TTTAAAAATC TTTGTTTTTT 
TTTTTCTTCC GAAAGA1TCC CCCAACATTA CCATTCCCCA CCTTCCX3TTG AATTTTTTTG 
GCTCTCATTT TGAATTTTTC AAGA 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCrTH: 2024 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI»JESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 83: 
CTGCAGGAAT TCGGCACAGC TGCGCTQGAG GCTTCATCTT TGCCGCCGCT GCCGTCGCCT 
TCCTGGGATT GGAGTCTCGA GCTTTCTTCG TTCGTTCGYC GGCGGGTTCG CGCCCTTCTC 
GCGCCTCGGG GCTGCGAGGC TGGGGAAGGG GTTGGAGGGG GCTGTTGATC GCCGCGTTTA 
AGTTGCGCTC GGGGCGGCCA TGTCGGCCGG CGAGGTCGAG CGCCTAGTGT CGGAGCTGAG 
CGGCGGGACC GGAGGGGATG AGGAGGAAGA GTGGCTCTAT GGCGATGAAA ATGAAGTTGA 
AAGGCCAGAA GAAGAAAATG CCAGTGCTAA TCCTCCATCT GGAATTGAAG ATGAAACTGC 
TGAAAATGGT GTACCAAAAC CGAAAGTGAC TGAGACCGAA GATGATAGTG ATAGTGACAG 
CGATGATGAT GAAGATGATG TTCATGTCAC TATAGGAGAC ATTAAAACGG GAGCACCACA 
GTATGGGAGT TATGGTACAG CACCTGTAAA TCTTAACATC AAGACAGGGG GAAGAGOTTA 
TGGAACTACA GGGACAAAAG TCAAAGGAGT AGACCTTGAT GCACCTGGAA GCATTAATGG 
AGTTCCACTC TTAGAGGTAG ATTTGGATTC TTTTGAAGAT AAACCATGGC GTAAACCTGG 
TGCTGATCTT TCTGATTATT TTAATTATGG GTTTAATGAA GATACCTGGA AAGCTTACTG 
TGAAAAACAA AAGAGGATAC GAATGGGACT TGAAGTTATA CCAGTAACCT CTACTACAAA 
TAAAATTACG GTACAGCAGG GAAGAACTGG AAACTCAGAG AAAGAAACTG CCCTTCCATC 
TACAAAAGCT GAGTTTACTT CTCCTCCTTC TTTGTTCAAG ACTGGGCTTC CACCGAGCAG 
GAGATTACCT GGGGCAATTG ATGTTATCGG TCAGACTATA ACTATCAGCC GAGTAGAAGG 
CAGGCGACGG GCAAATGAGA ACAGCAACAT ACAGGTCCTT TCTGAAAGAT CTGCTACTGA 
AGTAGACAAC AATTTTAGCA AACCACCTCC GTTTTTCCCT CCAGGAGCTC CTCCCACrCA 
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CCTTCCACCT CXTTCCATTTC TTCCACCTCC TCCGACTGTC AGCACTGCTC CACCTCTGAT 1140 

TCCACCACCX5 GGTTTTCCTC CTCCACCAGG CGCTCCACCT CCATCTCTTA TACCAACAAT 1200 

AGAAAGTC3GA CATTCCTCTG GTTATGATAG TCGTTCTGCA CGTGCATTTC CATATGC3CAA 1260 

TGTTGCCTTT CCCCATCTTC CTGGTTCTGC TCCTTCGTGG CCTAGTCTTG TGGACACrAG 1320 

CAAGCAGTGG GACTATTATG CXIAGAAGAGA GAAAGACCGA GATAGAGAGA GAGACAGAGA 1380 

CAGAGAGCGA GACCGTGATC GGGACAGAGA AAGAGAACGC ACCAGAGAGA GAGAGAQGGA 1440 

GCGTGATCAC AGTCCTACAC CAAGTGTTTT CAACAGCGAT GAAGAACX3AT ACAGATACAG 1500 

GGAATATGCA GAAAGAGGTT ATGAGCGTCA CAGAGCAAGT CGAGAAAAAG AAGAACGACA 1560 

TAGAGAAAGA CGACACAQGG AGAAAGAGGA AACCAGACAT AAGTCTTCTC GAAGTAATAG 1620 

TAGACGTCGC CATGAAAGTG AAGAAGGAGA TACTCACAGG AGACACAAAC ACAAAAAATC 1680 

TAAAAGAAGC AAAGAAGGAA AAGAAGCGGG CAGTGAGCCT GCXXXTTGAAC AGGAGAGCAC 1740 

CGAAGCTACA CCTGCAGAAT AGGCATGGTT TTGGCCTTTT GTGTATATTA GTACCAGAAG 1800 

TAGATACTAT AAATCTTGTT ATTTTTCTGG ATAATGTTTA AGAAAITTAC CTTAAATCTT 1.860 

GTTCTGTTTG TTAGTATGAA AAGTTAACTT TTTTTCCAAA ATAAAAGAGT GAATTTTTCA 1920 

TGTTAAGTTA AAAATCTTTG TCTTGTACTA TTTCAAAAAT AAAAAGACAG CAATGACTTT 1980 

ATATCTAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAGGGC QGCC 2024 
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(2) INFORMATION- FOR SEQ -ID NO: ^4: " - - - - , _ . . . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 beise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CGCGCCMATA GCCGGACGGG GATCTGAGCT GGCAGGATGA ATGTGGGGGT GGCACACAGC 60 

GAAGTAAACC CCAACACCCG AGTGATGAAT AGCCGAGGCA TCTGGCTGGC CTACATCATC 120 

^TTGGTAGGAT TGCTGCATAT GGTTCtACTC AGCATOCCCT TCTTCAGCAT TCCTGTTGTC 180 

TGGACCCTGA CCAACGTCAT CCATAACCTG GCTACGTATG TCTTCCTTCA TACGGTGAAA 240 

GGGACACCCT TTGAGACTCC TGACCAAGGA AAGGCTCGGC TACTGACACA CTGGGAGCAA 300 

ATGGACTATG GGCTCCAGTT TACCTCTTCC CGCAAGTTCC TCAGCATCTC TCCTATTGTG 360 

CTCTATCTCC TGGCCAGCTT CTATACCAAG TATGATGCTG CGCACTTCCT CATCAACACA 420 
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GCCTCATTGC TAAGTCTACT GCTGCCGAAG TTGCCCCAGT TCCATGGGGT TCGTCTCTTT 480 

GGCATCAACA AATACTGAGG GATGGGTTTT GGGACAGCTC CATGGGCATG GGGAAGGCAC 540 

TGAAACAGAG GACTATAAAA CATCCTTCTC TTATTCTCCA TACTGTCTTC TACACCTTTA 600 

AAC3CCTGAGA ACTATACAAC CTTTCCCAGA CTCCXIAAGAA GAGAAGAGAT TCGCAAATGG 660 

GGCTCCTGGG CCXIAGTCCTG CTAGTGGCAA GTTTCTTTGA ATCAGGAAGG CAGGTGAGGT 720 

AAGGGCCAAA TCACTCTCCT CCATAGCAGG AAGCCATTTG GGCAGCTCCT TTGGTGATTA 780 

CATCTTTCCA TATCTTITAC ACTTACCACC TTCCAGCTCT GTTTTGCTGT GTATTTTTCT 840 

15 TACAATAATT TTTTTCAGCT ATAGCTGCAG TTTAATCAGG ATGGGTAGAG AGCTGTCCTC " 900 

ATAAQGCTGG GGGTGGGAAG ATGGAATACT G 931 



10 



20 



(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 825 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
<D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

CGGGGCCGGC GGGGTCTTCA GQGTACCGGG CTGGTTACAG CAGCTCTACC CCTCACGACG 60 

CAAACATGGC AGCGCAGAAG GACCAGCAGA AAGATGCCGA GGCGGAAGGG CTGAGCGGCA 120 

35 

CGACCCTGCT GCCGAAGCTG ATTCCCTCCG GTGCAGGCCG C»SAGTGGC^_.C^ 180 . 

GCGCGACCAT CCGGCCCTGG AGCACCTTCG TGGACCAGCA GCGCTTCTCA CGGCCCCGCA 240 

40 ACCTGGGAGA GCTCTGCCAG CGCCTCGTAC GCAACGTGGA GTACTACCAG AGCAACTATG 300 

TGTTCGTGTT CCTGGGCCTC ATCCTGTACT GTGTGGTGAC GTCCCCTATG TTGCTGGTGG 360 

CTCTGGCTGT CTTTTTCGGC GCCTGTTACA TTCTCTATCT GCGCACCTTG GAGTCCAAGC 420 

45 

TTGTGCTCTT TGGCCGAGAG GTGAGCCCAG CGCATCAGTA TGCTCTGGCT GGAGGCATCT 480 

CCTTCCCCTT CTTCTGGCTG GCTGGTGCGG GCTCGGCCGT CTTCTGGGTG CTGQGAGCCA 540 

50 CCCTGGTGGT CATOGGCTCC GACGCTGCCT TCCACCAGAT TGAGGCTGTG GACGGGGAGG 600 

AGCTGCAGAT GGAACCCGTG TGAGGTGTCT TCTGGGACCT GCCGGCCTCC CGGGCCAGCT 660 

GCCCCACCCC TGCCCATGCC TGTCCTGCAC GGCTCTGCTG CTCGGGCCCA CAGOGCCGTC 720 

55 

CCATCACAAG CCCGGGGAGG GATCCCGCCT TTGAAAATAA AGCTGTTATG GGTGTCATTC 780 

AGGAAAAAAA AAAAAAAAGG GGGGCCCCTC TAGGGGTCAA AGTTA 825 

60 
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(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

CATGTAAAAG GATGAAATGT GACTTCTGGT GTTTTTTTAT TTCTATGGAG GGACTTTCTG 60 

15 GGGACGGTTT CTGGCTCTCA GGCTCTGAGA AC5CTGCAGTT TATGAGTGGC TCTGTGTGTC ' 120 

CTGCCACCTA CTGGAGAAGC CATAAGCTGC AGCTTTAGGA AAAGGGAACC CGGGGCAGAG 180 

TGTGGGGAAG TGGGATGC3CA GCATGGCAGG GCTTTGGAAA ATGAGAGGTC AGAGTKTKTC 240 

20 

CAGGAAGGGT GTAAGGAGAG GATGGATCCT GATACATGGA TTCAGGATCA TTAGGGTCCT 300 

GTCTGGGACA CTGGCCTTCC TGCTTACCTG CTCTTTCCTT CCTCCTTGGT CGGAGGAGGG 360 

25 GCTGGCTCAC TGCTCTGGCT TCATTTTCCA GAGCTGCCTG CTGCAGTCAC ACTTAGGTCA 420 

TCTTCTCTCA CTTTTCTCCT TTTGCCGATT AGTGGACGTG ACAGAGATGT GAATGGGGCA 480 

GGGATGTCCT TTGATGGCAT CAAGACTTTA GCTTCTGGTG CGCTGTGTCC CAGCTCTGAT 540 

30 

TTCAGTTGCA GCCGTGATGG AMAGTTNGCA TGGAAGCTGA GACTCTCACT GACAGTGAAA 600 

CCCTCAAATG AACACAATCC CTGCTTTCCT GCCAAGGATC CTTGTAGGGT NCCCCCAGCT 660 

35 TCCCCACTTT TTTTCTGTGT CCTGACAAAG AAACACAGAG TAACTTGATT GCCCTGTGAC 720 

CIGGCCAGTT- GCATTTGGGC -TGGAGGGTTG AGCCCAAOee -A^ 780 

AGGTTGTTGC CCAAAACACT GAAAAAAACT GCCCTGGCCC TQAACCAAAT ACCTTGAACC 840 

40 

CTCGTAAACT CCATACCCTG ACCCCCTTGT 1TTGGATATA CCCAGGTAGA ACAACTCTCT 900 

CTCACTGTCT GTTGTGAGGA TACGCTGTAG CCCACTCATT AAGTACATTC TCCTAATAAA 960 

45 TGCTTTGGAC TGATCACCCT GCCAGTCTTT TGTCTTGGGC AATCTATACT TTTNCTCAGA 1020 

GGTTCCCAAG GCCTACTGAA GGGACTTAAC ATACTCTTAA TGGCTTTCCT CTCTCTTGTT 1080 

TTACCTTATG CCCTCACTTC CTGAGTTAAC CTCCCAAATA CAGGATTCAC CTGTACCCAA 1140 

50 

GCCCTTAGCT TCAAGAATAC AGGATCACCT GTACCCAAGC CCTTAGCTCA AGCTCTGCTT 1200 

TGGAAGAACC CAAACTAAGA CAGTGCTCCT GGTGCCCT 1238 

55 

(2) INFORMATION FOR SEQ ID NO: 87: 
60 (i) SEQUENCE CHARACTERISTICS; 
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(A) LENGTH: 1460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

ATTGCCTTCT GGTCCCTGGT GACACTGGGG TCATCCTTCA TCCCCGGAGA GCATTTCTGG 60 

10 CTGCTCCTCC TGACCCGGGG CCTGGTGGGG GTCGGGGAC5G CCAGTTATTC CACCATCGCG 120 

CCCACTCTCA TTGCCGACCT CTTTGTGGCC GACCAGOGCG ACCGGATGCT CAGCATCTTC ISO 

TACTTTGCCA TTCCGGTGGG CAGTGGTCTG GGCTACATTG CAGGCTCCAA AGTGAAGGAT 240 

15 

ATGGCTGGAG ACTGGCACTG GGCTCTGAGG GTGACACCGG GTCTAGGAGT GGTGGCCGTT 300 

CTGCTGCTGT TCCTGGTAGT GCGGGAGCCG CCAAGGGGAG CCGTGGAGCG CCACTCAGAT 360 

20 TTGCCACCCC TGAACCCCAC CTCGTGGTGG GCAGATCTGA GGGCTCTGGC AAGAAATCCT 420 

AGTTTCGTCC TGTCTTCCCT GGGCTTCACT GCTGTGGCCT TTGTCACGGG CTCCCTGGCT 480 

CTGTGGGCTC CGGCATTCCT GCTGCGTTCC CGCGTGGTCC TTGGGGAGAC CCCACCCTGC 540 

25 

CTTCCCGGAG ACTCCTGCTC TTCCTCTGAC AGTCTCATCT TTGGACTCAT CACCTGCCTG 600 

ACCGGAGTCC TGGGTGTGGG CCTGGGTGTG GAGATCAGCC GCCGGCTCCG CCACTCCAAC 660 

30 CCCCGGGCTG ATCCCCTGGT CTGTGCCACT GGCCTCCTGG GCTCTGCACC CTTCCTCTTC 720 - 

CTGTCCCTTG CCTGCGCCCG TGGTAGCATC GTGGCCACTT ATATTTTCAT CTTCATTGGA 780 

GAGACCCTCC TGTCCATGAA CTGGGCCATC GTGGCCGACA TTCTGCTGTA CGTGGTGATC 840 

35 

CCTACCCGAC GCTCCACCGC CGAGGCCTTC CAGATCGTGC TGTCCCACCT GCTGGGTGAT 900 

GCTGGGAGCC CCTACCTCAT TGGCCTGATC TCTGACCGCC TGCGCCGGAA CTGGCCCCCC 960 

40 TCCTTCTTGT CCGAGTTCCG GGCTCTGCAG TTCTCGCTCA TGCTCTGCGC GTTTGTTGGG 1020 

GCACTGGGCG GCGCACTTCC TGGGCACCGC CATCTTCATT GAGGCCGACC GCCGGCGGGC 1080 

ACAGCTGCAC GTGCAGGGCC TGCTGCACGA AGCAGGGTCC ACAGACGACC GGATTGTGGT 1140 

45 

GCCCCAGCGG GGCCGCTCCA CCCGCGTGCC CGTGGCCAGT GTGCTCATCT GGAGAGGCTG 1200 

CCGCTCACCT ACCTGCACAT CTGCCACAGC TGGCCCTGGG CCCACCCCAC GAAGGGCCTG 1260 

50 GGCCTAAACC CCTTGGCCTG GCCCAGCTTC CAGAGGGACC CTGGGCCGTG TGCCAGCTCC 1320 

CAGACACTAC ATGGGTAGCT CAGGGGAGGA GGTGGGGGTC CAGGAGGGGG ATCCCTCTCC 1380 

AACAGGGGCA GCCCCAAGGG CTCGGTGCTA TTTGTAACGG GATTAAAATT TGTAGCCAGA 1440 

55 

AAAAAAAAAA AAAAAAAAAA 1460 



60 
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(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

CAGGTGCAAA GTGQGAAGTG TGAGTCCTCA GTCTTGGGCT ATTCGGCCAC GTGCCTGCCG 60 

GACATGGGAC GCTGGAGGGT CAGCAGCGTG GAGTCCTGGC CTTTTGCGTC CACGGGTGGG 120 

AAATTGGCCA TTGCCACGGC GGGAACTGGG ACTCAGGCTG CCCCCCGGCC GTTTCTCATC * 180 

CC5TCCACCGG AVTCGTGGGC GCTOGCACTG GCGCTGATGT AGTTTCCTGA CCTCTGACCC 240 

GTATTGTCTC CAGATTAAAG GTACGACATT TGGAGGCCCC AGCGAGAAAC GTCACCGGGA 300 

GAAACGTCAC CGGGCGAGAG CGGKCCCGCT C5TGTGCTCCC CCGGAAGGAC AGCCAGCTTC 360 

TAGGGGGGAG TGCCACCTGA AAAAAAAATT TCCAGGTCCC CAAAGGGTGA CCGTCTTCCG 420 

GAGACAGCGG ATCGACTACC ATGTGGGTGC CCACAAAAAT TYCACCTYTG AGTCCTCAAC 480 

TGCTGACCCC GGGGTCAGTT CCAGAGAGAA GGACTCCCTC CTGCTTGGAA GAGACCTCAC 540 

ACCGTCATCA CGATGCCAAC GGCTCTGAAG GTGGATGGCA TTCCTGCGTG GATTCATCAC 600 

TCCCGCATCA AAAAGGCCAA CRGAGCCCAA CTAGAAACAT GGGTCCCCAG GGCTGGGTCA 660 

GGCCCCTTAA 7ACTGCACCT AAGTTGGGTG AAGCCATTAG ATTAATTCTT TTTCTTAATT 720 

TTGTAAAACA ATGCATAGCT TCTGTCAACT TATGTATCTT AAGACTCAAT ATAACCCCCT 780 

TGCTATAACT„ GAGGGAATCA ATGATTTGAT TCCCGAAAAA CAGAAGTGGG GAATGTAGTG - 84^ 

TCCAACCTGG TTTTTACTAA CCCTGTTrTT AGACTYTCCC TTTCCTTTAA TCACTCAGCC 900 

TTGTTTCCAC CTGAATTGAC TCTCCCTTAG CTAAGAGCGC CAGATGGACT CCATCTTGGC 960 

TCTTTCNACT GGCAGCCGCT TCCTYCAAGG ACTTAACTTG TGCAAGCTGA CTCCCAGCAC 1020 

ATCCAAGAAT GCAATTAACT GATAAGATAC TGTGGCAAGC TATATCCGCA GTTCCCAGGA 1080 

ATTCGTCCAA TTGATTACAC CCMAAAGCCC CGCGTCTATC ACCTTGTAAT AATCTTAAAG 1140 

CCCCTGCACC TGGAACTATT AACGTTCCTG TAACCATTTA TCCTTTTAAC TTTTTTGCCT 1200 

ACTTTArrrC TGTAAAATTG TTTTAACTAG ACCCCCCCTC TCCTTTCTAA ACCAAAGTAT 1260 

AAAAGCAAAT CTAGCCCCTT CTTCAGGCCG AGAGAATTTC GAGCGTTAGC CGTCTCTTQG 1320 

CCACCAGCTA AATAAACGGA TTCTTCATGT GTAAAAAAAA AAAAAAAAAA CTCGGAGGGG 1380 

GGGCCCGGTA CCCAA 1395 



60 
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(2) INFORMATION FX>R SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1X86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GGCACGAGCC GGCAAGCCGA GCTAGGGTGA AAACTGGGGG OGCACCAGGA TGTNNGACAG 60 

AAAAGCAGAA GATGAGACTC TGTTCATTCA CTTTTCCTAG GCCCATCCTG TGGTCATCTT 120 

15 

TCCCCCTCCC ATCATACCTC CTCCTTCCTG GAGCCTCTGC CGGCTTGGCT GTAATGGTGG 180 

CACTTACCTG GATATTTCAG TGGGAGGATG AAAGGCGAGA CTCACCCTAC GCGGTGGGAC 240 

20 AGATGGQGAG AGGAAAAAGG CAGAGATGGC CAGGAGAGGG GTGCAGGACA AACCAGAGAG 300 

GTTGGGTCAG GGGAAAAGGG TGGGGAGAAA GAGGGGTGCA GGCCCTGCAG GCCGGTTAGC 360 

CAGCAGCTGC GGCCTCCCCG GGCCCTTGGC ATCCAACTTC GCAGACAGQG TACCAGCCTC 420 

25 

CTGGTGTGTA TCATAGGATT TGTTCACATA GTGTTATGCA TGATCTTCGT AAGGTTAAGA 480 

AGCCGTGGTG GTGCACCATG ACATCCAACC CGTATATATA AAGATAAATA TATATATATA 540 

30 TGTATGTAAA TTATGGCACG AGAAATTATA GCACTGAGGG CCCTGCTGCC CTGCTGGACC 600 

AAGCAAAACT AAGCCmTG GTTTGGGTAT TATGTTTCGT TTTGTTATTT GTTTGTTTTT 660 

GTGGCTTGTC TTATGTCGTG ATAGCACAAG TGCCAGTCGG ATTGCTCTGT ATTACAGAAT 720 

35 

AGTGTTTTTA ATTCATCAAT GTTCTA<OT'A JlTC CAGCACCTCC _TCTTA^ 780 

ATTTTAGGAG GTTGCCCAAT TTTGTTTCTT CAATTTTACT GGTTACTTTT TTGTACAAAT 840 

40 CAATCTCTTT CTCTCTTTCT CTCCTCCCCA CCTCTCACCC TTGCCCTCTC CATCTCCCTC 900 

TCCCGCCCTC CCCTCCTCCC TCTQGCTCCC CGTCTCATTT CTGTCCACTC CATTCTCTCT 960 

CCCTCTGTCC TGCCTCCTGC TGCCCCCTCC CCAGCCCACT TCCCCGAGTT GTGCTTGCCG 1020 

45 

CTCCTTATCT GTTCTAGTTC CGAAGCAGTT TCACTCGAAG TTGTGCAGTC CTGGTTGCAG 1080 

CTTTCCGCAT CTGCCTTCGT TTCGTGTAGA TTGACGCGTT TCTTTGTAAT TTCAGTGTTT 1140 

50 CTGACAAGAT TTAAAAAAAA AAAAAGGAAA AAAAAAAAAA AAAAAA 1186 



55 (2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1821 base pairs 

(B) TYPE: nucleic acid 
60 <C) STRANDEDNESS : double 
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(D) TOPOUX3Y: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 90: 

AAAACATGCT TTCAGGGCGT CCCCTATGTA TTCGGGGGGC CCACGGACAC TCAGGCTGGA 60 

KATCCGTCCT CACTCCGCTC AAGATGGCCT CAGCAGACAC CAGTTACCCA GCTGAAAGTC 120 

ACAATCCCTC CCAGAAGTCT CCCAACACTA GTGCTGACCA GAGGTGGGGC TCTCAGGCTA 180 

GGAGTTTCAC ACACAATGAC AGGCTGCTGG GGGACATTGC AGGACCCCTT TTCCTYTCCT 240 

CTCCATGCTA GAAGCCAGCC CTAGGMAGCT GCAGTTACTC CCTGTGACTC AGCAGCAGGC 300 

TGATTCAACA CAGCTGCCCA CACAAAGCCA GTGGTAATAC ATCTGTTTAC CTTTCCCTAT 360 

CACCCAGACA CAAGCCCCTT TCCCAGGTCA AACCACAGGC CGATOCATCT CCAGTTTGAC 420 

AGTCAAATCA CTACTTCCAT TGCTACTTTA GATCAGCCAA AGTGGTGACT GCTGCAGTGT 480 

GTGGCTATCC CTACAAGGCC CACCCAAGGG ATGCCCAAAG CCCAACCTTC TCCAGGGCTG 540 

CAGCAGNAGC AACCCCACCA GCCTAAGTCC AGCAGAGGAC CTCCCACCCA ATGTCTTGTT 600 

CTAATTAGAA GGGGAAGTTA GCCACAGAAA ATCAACTTAT CTATAATTAC AAAATTCTCT 660 

OXSACTCACCT TAAAGTPCCT ATTGACATCT ACTGCTTTTA AACCTATTTG AAAACTCTGA 720 

TACTAAAACA AATGACACTC TAAGAAAGTT TGGGAGCCCC ATGCTGAGAA CCATTTCTGT 780 

GCAGTGAGGA TGTTTCCAGA AGCTACTTAC CTACATGTGA ATGTGCCATT TTCTTTCCTT 840 

TTGTAGAGAA AATCCCCTTT ACTTTTTGGA ACAGTAATGG CAGCTTCTAG TACAGCCATT 900 

ACAGTTTCAT ATGAGAAAAA TTAAGAATAA CTATAAAATT GTTAAAATAT CCAATAATGG 960 

ATAATCATGG CCAGAAGATT TAACATACAA" AGTTJITTCTC "AATGTAAAGC TATTCAGCTC 1020' 

TTCCAGGTTG AATGCCCTGT AACCCACCCT GACCTTCCAC ATCATCTTCA AAAAGCAGTT 1080 

TCTCTGTTCC CCATGATTCT CCTATAAGGT AACTCTTTAG TCCTCCATTT AGCACATTTT 1140 

AAATCCTCCA AAGAATAAGT ATCATGTGAT TATTTTAGCT TTACAAAAAA AAAGTTGAAT 1200 

GGCGTTTTAT TTTCATQGCC TATAAGCAGG TACCTTAGTA GGGCAGATAT AGGAAAAACA 1260 

AATTAGAGCA AAACAAATCC TCTACAAATC CAAGGCAGGA AAAGTGGTGG CAGAGTGACT 1320 

CATTCTCCTG TCCCTCCCAT CAGGTCAAAT CAGGAGGCTG CAGTGAATGC CTGTTdTTG 1380 

AATGTGTAGC AGTTGTTCCT GTAACTCTTT AAAACTTGGC TATAGGCTGT TTAQCACAGT 1440 

ACAGATTAAA GATACAGTTA CGTAAACAGC AAAGTAATTT TATAGTGCTT CATCCATTTA 1500 

TCATGCTTTG GTTTGCTAAT TTTTTCACAT ACCTTTTTCT ATCACAGTCT GTTGCTTTTG 1560 

TACACATTTC TCATATTGGG GTTCGACAGG TAAACACAAA CTGCTATTTC AGTAGAAAAA 1620 

GTTATTGTTA TGGAATATTA AACCCAATAA ATTGTATAAA GGGTAAAAAA 7AAAAAAAAA 1680 
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AAAAAAAAAA AAAAAAAAAA AAAAAAATTC CTGCGGGCCG CANGCTTTTT CCCTTTGGGT 1740 

GAGGGGTTAT TTTNGGCTTG GGCACTGGGC CCTTCGTTTT TACAACGTCG TGANGGGGGG 1800 

5 AACCCGGGGG GGGTTTCCCC C 1821 



10 (2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEENESS : double 

(D) TOPOIiOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

20 TGCCCTTTTT CCCACCGATT CGGGGCNTGG TGAAGGTGGG AGATGTGAAC TCCAATTAAG 60 

GGACTGQAGA GAGGTGAAGA ATTTTGCAGG TGGGAGATTT GGATTTGAAT GTGGACTTGT 120 

AAATGACTTG ACCTTGCCAT CTGTGTTCAA GGTCACGGTT TGCTGTC3GGG TTCCTGGGAG 180 

25 

AGCTTACTCA CCCCGGAGTC TTTTCTTTCT CTTGCTCCAA GAAGACCCCT GTTGGTGCTT 240 

TACCACCGCT TGGAGTCTCC CGAGGACACA AACAGGCAGA GAGGGACGTG TAGGGAGAGT 300 

30 TCTTTCCTGT TTTCTGTGCT TTCCTTTTTA CAGGACTCCC GGAAGGCCAC TCATGGCCAT 360 

GCCAGGAGCT TTCTCAGAAA CAGTCATAAA CGATCTCTTG AGTCTCTTTC TTGTCCTCCC 420 

AGCTGAGCTT TCTTATTCCA CCCTTTCTGG TGTCTATAGG AATGCATGAC5 AAGACCCTGG 480 

35 

GACGTTTTTC TGCTCTCTTC TGGCCCTCCA TGGAGCCATG GGCCTCGGCC TOGGCGGCTC 540 



CTCACCCTCA CAATTTATTT CCTCCTCCCG TGCCAGCCCT TCTTTTGTGT CTGAAACCGG 600 

40 TTTTAAAATG TGACTCTCCC AGAGAAGAAG CCGCTGGCTG TATGAAACTT GACGGCGCTT 660 

TTGTAAGGTG CCACCCCCAA ACTTTAAGGT AGCTAAACCA ATTTTTAAAA GATTCAATOG 720 

CTTGTTCATC CTCCAGATGT AGCTATTGAT GTACACTTCG CAACGGAGTG TCTGAAATTG 780 

45 

TGGTQGTCCT QATTTATAGG ATTTCATAAT TAAAATGTCT GCTGAATAAA AAAAAAAAAA 840 

AAAAACTCGA GGGGGGCCCG GT 862 

50 

(2) INFORMATION FOR SEQ ID NO: 92: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 
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(xi) SEQUEI^E DESCRIPTION: SEQ ID NO: 92: 

CTGAGGCGAG TGAAGTGGAC TCTGAGGGCT ACCGCTACCG CCACTGCTGC GGCAGGGGOG 60 

TGGAGGGCAG AGGGCCX3CGG AGGCCGCAGT TGCAAACATG GCTCAGAGCA GAGACGGCGG 120 

AAACCCGTTC GCCGAGCCCA GCGAGCTTGA CAACXTCCTTT CAGGACXXAG CTGTGATCCA 180 

GCACXX5ACCC AGCCGGCAGT ATGCCACGCT TGACGTCTAC AACCCTTTTG AGACCCGGGA 240 

GCCACCACCA GCCTATGAGC CTCCAGCCCC TGCXXTCATTG CCTCCACCCT CAGCTCCCTC 300 

CTTGCAGCXrC TCGAGAAAGC TCAGCCCCAC AGAACCTAAG AACTATGGCT CATACAGCAC 360 

15 TCAGGCCTCA GCTGCAGCAG CCACAGCTGA GCTGCTGAAG AAACAGGAGG AGCTCAACCXS 420 

GAAGGCAGAG GAGTTGGACC GAAGGAGCGA GAGCTGCAGC ATGCTGCXrCT GGGRGGCACA 480 

GCTACTCGAC AGAACAATTG GCXTCCCTCTA CCTTCTITTT GTCCAGTTCA GCCCTGCTIT 540 

20 

TTCCAGGACA TCTCCATGGA GATCCCCX:AA GAAITTCAGA AGACTGTATC CACCATGTAC 600 

TACCTCTGGA TGTGCAGCAC GSTGGNTCTT CTCCTGAAYT TCMTCOSCTG CXTTOGCCAGT 660 

25 TCTGTGTGGA AACCAACAAT GGCGAGGCTT TGGGTT 696 



30 (2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NGTH: 1886 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

40 CAGGCCACTG ACGCTTCTTT GCGAGGGATG CAGGAGGTCC TACAGAGAAA GGOGCTTCTT 60 

GCATKTCAGA GGGCCCACAG CCTGTCACCC ACAGATCACC AAGCAGCTTT CTACCTGGCT 120 

CTGCAGCTTG CCATCTCCAG ACAGATCCCA GAGGCTCTGG GGTATGTCCG CCAAGCTCTT 180 

45 

CAGCTTCAAG GTGACGATGC CAACTCCCTG CACCTCCTTG CCCTCCTGCT GTCAGCACAG 240 

AAGCATTACC ATGACGCTCT GAACATCATC GACATGGCCC TGAGTGAATA CCCAGAAAAT 300 

50 TTCATACTAC TGTTTTCCAA AGTGAAGTTG CAGTCACTCT GCCGAGGCCC GGACGARGCA 360 

CTGCTGACTT GTAAGCACAT GCTGCAGATA TGGAAATCCT GCTACAACCT CACCAACCCC 420 

AGTGATTCTG GACGTGGGAG CAGCCTCTTA GATAGAACCA TTGCTGACAG ACGACAGCTT 480 

55 

AATACAATTA CTTTGCCAGA CTTCAGCGAT CCCGAGACAG GCTCCGTCCA TGCCACATCG 540 

GTAGCAGCCT CAAGAGTGGA GCAGGCACTG TCGGAAGTGG CTTCGTCTCT GCAGAGCATG 600 

60 CCCCTAAGCA GGGCCCGCTG CACCCCTGGA TGACGCTGGC ACAGATCTGG CTCCATGCAG 660 
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(2) INFORMATION FOR SEQ ID NO: 94: 



CTGAAGTCTA TATCGGCATC GGGAAGCCTG CAGAAGCCAC AGCCTC3TACC CAAGAAGCTG 720 

CCAACCTCTT CCCAATGTCC CACAATGTCC TCTACATGCG CGGCCAGATT GCTGAGCTCC 780 

5 

GGQGAAGCAT GGACGAGGCG CGGCGGTCGT ATGAAGAGGC CTTAGCCANT CAGCCCCACC 840 

CACGTGAAGA GCATGCAGCG ACTTGGCCXrr GATCCTTCAC CAGVTAGGCC GYTACAGTYT 900 

10 GGCGGAGAAG ATCCTCCGGG ACGCX3GTGCA GGTGAACTCG ACAGCCCACG AGGTCTGQAA 960 

CGGGCTGGGC GAGC3TCCTCC AAGCTCAGGG CAACX5ATCCG GCGGCTACGG AGTGCTTCCT 1020 

GACAGCCTTG GAGCTGGAGG CX^VGCAGCCC CGCCGTOCCC TTCACCATCA TCCCCCX3CGT 1080 

15 

GCrCTGAGCA GGCGCCTGCC AGCCTCACXTT GCCGCTCAGC CTNCAGAGGC CXTTOCCGGGC 1140 

ACCAGGGCTT GTGCXyVTCGC CCCAAGGGGA TGAATCTGCC GCACTGAGGC CAGGGACGAG 1200 

20 TGTTCAGTGG GCCACAGTGA ACCAACCAAA CCAACCCCGA ATCATCGCTC TCGCTATCTG 1260 

CXTTTTCTCTT GTTTTTTTTG CCAGCCCAAT GGTAGTTTCT GAACCTATTG ACATTGTTCA 1320 

AAATGGATCA TGTGCCATAT TrTGTTAGTT GACATCTGAG TTTTCAGTAA AATGATTATG 1380 

25 

GAATTAATCA GCAAATGTAG AAGAATATAT TCAAAGTTAA AATTCAGTGG CAGCACAGAT 1440 

TATTTTTATC AGAGCTGTAA AGAAAACAAC TGTCCTTTTC TCCCCACCAC CCCTCCTGCC 1500 

30 CCACTTTGGC CCAGAAACCA AATGTGAACT TCCTGTCTCC CACCTCAGCA CTAGTCCATG 1560 

CCAGGACACC AGCTGACAAT TTCOTGGTTT TACTGTCAAT AATTGTACCA TGTGATCAAT 1620 

TACTGTCCTC ACTTAGAACA AAGCCTGAGT CCGAGAATAT TTATATTTTA CCAATATATG 1680 

35 

CCTGTTACAA GAGAAGGAAA TATGAGTTAT TrAAGTTTAA CTTTTTTATG TGAATTCAGA 1740 

GTTTATTTAT CGAGGGAAAT ATGTACAAAG AAGCTTCAAA TQGAATATTT ACCGACATTC 1800 

40 CTTATACATG ACAGACACTT GGCTACATGG GAAGATGATG TTAATAATAA AATGATTTTT 1860 

AAATGGAAAA AAAAAAAAAA AAAAAN 1886 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1774 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 (xi)- SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

CTCAGCTACC GTATACAGTA GGACATAACC CCATTTCACA TGCACTACAC TGAGACTTGC 60 
CTCCTCTCCC CCCACATTGA AGATGTTCTT TTTTCATAAC TATATACTAT TCCATTGCAT 120 

60 



UJOCiD: <WO ^983944aA2_l_> 



wo 98/39448 



PCT/US98/04493 



315 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



GAATATTCTG TAATTTATTT AATCCCCTAT GGATTGATAA TTAGGTTCAT TATAGATAGA 180 

AGTGTAATTA ACATTCCTGT ACATGTATTT TGCTACTTGT GTGGGTATTT CTGTAGGATG 240 

AATAACTAGA AATTTATTGG ATCAGGTTTC ACATTTGCAG TTTTGAAAAC TACTACCAAA 300 

AAGATTTCAC CAATTTACAA CTCCATCATT AGTAAGAATG CXrTGTTTGCC TATAGTCTGC 360 

CAACCXTTGAA TCCTTAAAAA TTTTTGCCAA TCTGGTAGGC AAAATTTCTT TCTTTTCTTT 420 

GAATATTAAT GAGGAGGAAC ATCTTTTCAT GTTTCTTGGC CATTTGCATT TCCTATTATC 480 

AATTGCTTTT GCCCATTTTC CTTTTTTTAA TTATGAAAGT CTAATGACTA CCTTCTCATT 540 

GTATAAAAAA CACAGTTCTT TGAATAGAGA GACCXTTTTTC TCCAATGCTA CCAATCACAT ' 600 

TCCACTTACC ACAGTTTAAC ATACATTCCTC TAGTCACCTT TCCGTACGAA TATACATACA 660 

CATAAAAACA CmTTACAT AAATAGGATC TCATATTCTG TAGCTTTTTA AAATTTTCGT 720 

CTCAAAAAAA GATAACAGGT CTTTAAATTT CTTTAATGGT TGAATATGAT TAAATACTAT 780 

GAAAATGCCA TTATTTATTC CCTTAATTTT TTTCCTCTCG CTATTACATT GCCAAAGTAA 840 

ACATCCTATT CAGATGTCTT TGTGCATGTG TGTGAATATT TCTTTAGTCT GGAGTCCAGT 900 

AAGGTGGATT TTTGGATCAA AGGGTTTGTT CTCTCTCCAC CTTCAGTCTT CXTCAAAGGCX: 960 

TTCATAACTG TATTTTCACC AAGTGTATGG AGAATGTTCA TTTCCCCATA TAACCATACC 1020 

TACACTTGAT AGTTTTTATC TGTTGGGCGA AAAAGAACCT TTTCTTATTT TGCATTTCCC 1080 

TGATTATAAA AAAAAATGGT GAGATTQGGG TTATTTTCAT GTTTATrGGC CATTTATAGT 1140 

TTACTGTGGA TTGTTTGTAT CCCTTACCTG CTTTCTATTG GGTTATGTGT GGATATATTG 1200 

TTTTTATTTG TTGAGeATeT- eCTTeCCCAT OTICTGGTAA- OVCAACCTTT ATTTATTTGT" ' " 1260 

GGGGAACCTA TTCCCTGTGG CTTAGGTGAG CATGTGACCA GGCCTGGCCT CCTGAGTCCC 1320 

ACAGCTTCCT AGCCACAGTG ATAAAAGAAT GGGTATATAA CTTAAGCCAG GCTAAGGAAA 1380 

GCCCTTAACA GAACTTCTGC TGGAACTACT GGAAAGAAGG CTTTATGGAG ATCCCAGGAA 1440 

CCAAGGACCA TGTAAGCCTG AATTTGTGCC ATGTGGAGAG AGTCTGTCTG AGGAGAAACT 1500 

CGGATGCTAG CAGAAATGGA AAGAGAACTA AGTTCTGATG TCATTTTTCT GGAGGCCCTA 1560 

GATCCAGCTG TGCCTAAAGC CTGCCCTACT CCGGACTTTA AAGTTTTGTG AGCCAATAAA 1620 

GTCCCTTTCT TGTTTAAGAT AATTGAATTG AGTTTCTGTT CTGATTAATA TAGGTTATTT 1680 

GTATTTTCTT ATTGATTTGr AGAAAACCTT TGTAATTTTA AATTCTAGAC TTTATCCACT 1740 

ATATAAGTTA ATAAAATTAG CATGGCCTTC CATG 1774 



60 (2) INFORMATION FOR SEQ ID NO: 95: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LET^GTH: 2503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEnbffiSS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

GGCACGAGCG AAGGCAAGGG GGCACCAGCT CAGGACTGCA TCTGCCTOCC ATTTCCCTTC 60 

CACTCCTCCT TTCTGGAGTC TGACATTAGA AAGCCAGCGA GAAGGAAGAT TCAAACAACC 120 

AACCCTGATT TCCTGCTTCT CCTTTTCATG AGTGTTCCTG TGGTCTCTGC ACCTCCTTTC 180 

TGTCCCCCGG CAGAGGGCAG TAGAGATGGC CGGCCCAAGG CCTCRGTGGC GCGACCAGCT 240 

GCTGTTCATG AGCATCATAG TCCTCGTGAT TGTGGTCATC TGCCTGATGT TATACGCTCT 300 

TCTCTGGGAG GCTGGCAACC TCACTGACCT GCCCAACCTG AGAATCGGCT TCTATAACTT 360 

CTGCCTGTGG AATGAGGACA CCAGCACCCT ACAGTGTCAC CAGTTCCCTG AGCTC3GAAGC 420 

CCTGGGGGTG CCTCGGGTTG GCCTGGGCCT GGCCAGGCTT GGCGTGTACG GGTCCCTGGT 480 

CCTCACCCTC TTTGCCCCCC AGCCTCTCCT CCTAGCCCAG TGCAACAKTG ATGAGAGAGC 540 

GTGGCGSCTG GCAGTGGGCT TCCTGGCTGT KTCCTCTGTG CTGCTGGCAG GCGGCCTGGG 600 

CCTCTTCCTC TCCTATGTGT GGAATGQGTC ARGCTCTCCC TCCCGGGGCC TGGGTTTCTA 660 

GCTCTGGGCA GCGCCCAGSC CTTACTCATC CTCTTGCTTA TAGCCATGGC TGTGTTCCCT 720 

CTGAGGGCTG AGAGGGCTGA GAGCAAGCTT GAGAGCTGCT AAAGGCTTAC GTGATTGCAA 780 

GGGTTCAGTT CCAACCATG G TCAGAGGTGG CACA TCTGCT CAGCCATCTC ATTT TACAGC 8*0 

TAACGCTGAT CTCCAGCTCC AGCGATGGAA CCCACTACAG AGGAGGTGGG GCCCCTGTGT 900 

CAAAGAGGCC GAGGGGCAGC AAGGGCAGMC AGGGCACCTG TGACTTCTTA GTACAAGATT 960 

GTCTGTCCTT CAGGACTTCC AAGGCTCCCA AAGACTCCCT AAACCATGCA GCTCATTGTC 1020 

ACACCAATTC CTGCTTTAAT TAATGGATCT GAGCAAATCT TCCTCTAGCT TCAGGAGGGT 1080 

GGGGAGGGAG TGATTGCTGT CATGGGGCCA GACTTCCAGG CTGATTTGCC AAATGCCAAA 1140 

ATGAAACCTA GCAAAGAACT TACGGCAACA AACGAGGACA TTAAAAGAGC GAGCACCTCA 1200 

GTGTCTCTGG GGACATGGTT AAGGAGCTTC CACTCAGCCC ACCATAGTGA GTGGGCOGCC 1260 

ATAAGCCATC ACTQGAACTC CAACCCCAGA GGTCCAGGAG TGATCTCTGA GTGACTCAAC 1320 

AAAGACAGGA CACATGGGGT ACAAAGACAA GGCTTGACTG CTTCAAAGCT TCCCTGGACC 1380 

TGAAGCCAGA CAGGGCAGAG GCGTCCGCTG ACAAATCACT CCCATGATGA GACCCTGGAG 1440 

GACTCCAAAT CCTCGCTGTG AACAGGACTG GACGGTTGCG CACAAACAAA CGCTGCCACC 1500 

CTCCACTTCC CAACCCAGAA CTTGGAAAGA CATTAGCACA ACTTACGCAT TGGQGAATTG 1560 
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TGTGTATTTT CTAGCACTTG TGTATTGGAA AACCTGTATG GCAGTGATTT ATTCATATAT 1620 

TCCTGTCCAA AGCCACACTG AAAACAGAGG CAGAGACATG TACTCTGGTG TGATCTCTTG 1680 

TCCTCAGTGT CTCTOCTGGG CTCCTGTCCC TCTTGCTTTA TAGCTAGCTG CCCGGGGACC 1740 

AAGGTACAGG TGAAAGCAAG GTAGCAGCTT GCGGGAGGAG GCCTGTCTGG CTTACCAGTC 1800 

TATACACTGT GGCCTCAACC TCCCAGACAG GGCAGAGAAC TGTGGGCAOC TCGTTTOCTT 1860 

TCTAGGCTGG CTGGAGAGGT GGGAGCTCAT TGATAGACTC ATGATGGAAA CTATTTTTGA 1920 

AACAGGCTTC CTCCTTCAGG AGAGATCATG CGGACTAAAC TGTAGCAATT CCAGTGCACC 1980 

TGGCAGTGAT CCTTTTCTTT GCAAAGTACT GTCTCTTTGG TTCCAC?rAAG TTGGACX::ACC 2040 

ACATGACATY ATTTTCCCTG GAACCTGGTC ACTGACTAAC ACAGACAATT GGGACTCCAG 2100 

AGCCTCAAGA GCCAGGAGAG GGCACAGTAC ATACAGAGGG AGTCAAATGG GATCICATTT 2160 

TGAGTCCTGC CTTCCGCACA CTCAGAACGG CANCCCCAAG GCCCGGAGTC TCCAGGGCTT 2220 

CTGGCCTGAG GTGAATCTGC CAOGCCCAAG AAGGCACAAA GGTAGQAGCA CAGAGAGCCC 2280 

CATTCCCACA GGCGGKCGGC CCAGCAGCAC CAGTGGAAGC TCAGCTGTCC TCCAGCTGCT 2340 

CTCGGCAGAC AGTTCAGTGC ACAGTTTATG CCCTAGCTGA AAAAGATCTC CCGGACGTAT 2400 

TTCAGCACAT CCTCTTCCTC CTCCTCCTCA GGGCTCCTGC TACAGGCAGA GCTGGAACCC 2460 

CCCGGCCTCT GGGAAGGGCT GAGGCCTGGA GYCAGTGCCT GTC 2503 



35 



40 



45 



50 



55 



60 



(2 ) INFORMATION-FOR SEQ ID NO: - 96 : - - - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 

CTGGAAAGCC GAGGGTAGCC GAGCGGGGCG GGCGCTCTGG 

CCGTCCGCTC CGCCAGAAGC ACCGAGCAGC CGAGCCGGGG 

TGAGGCCCGA GTGAGGCGCG GCGGCTATAG CCGACCCGCG 

ATCGCGAGCG CACGACMAGC GGCCCCTGGA GGAGGAGGCG 

ACGGTTTCGA TCGGGCCCCA GGTGCTGGTC GGGGCCGGAR 

GGGGCGGGCC TRAGGGCGGC GGTTTYCCGA AMQGARCGGR 

ACCAGCCGCC GCAACCCAAA GCCCCGGGCT TYCTGCARCC 



96: 

AGCGGCGGGT GCTCGGGCTG 60 

CCCGCCGCCC TCCTCCTCCA 120 

GCGCCTTCCC CCCGCGTCCT 180 

GAGGAGGAGG AGCATGTCGG 240 

CCGGGGCCTG GGCCGCGGAG 300 

GCCTGCTGAG CGGRCGCGGC 360 

AMCGCCGCTG CGCCARCCCA 420 
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GGACGACCCC GCCX3CCAGGG GCCCAGTGCX; AGGTCCCCGC CAGCCCXXAG CGGCCTTCCX: 480 

GGCCCGGGGC GCTCCCAGAG CAAACGAGGC CXXTGAGAGC TCCACCTAGT TCACAGGATA 540 

AAATCCCACA GCAGAACTCG GAGTCAGCAA TGGCTAAGCC CXAGGTGGTT GTAGCTCCTG 600 

TATTAATGTC TAAGCTGTCT GTGAATGCCX: CTGAATTTTA CCCTTCAGGT TATTCTTCCA 660 

GTTACACAGA ATCCTATGAG GATGGTTGTG AGGATTATCC TACTCTATCA GAATATGTTC 720 

AGGATTTTTT GAATCATCTT ACAGAGCAGC CTGGCAGTTT TGAAACTGAA ATTGAACAGT 780 

TTGCAGAGAC CCTGAATGGT TGTGTTACAA CAGATGATGC TTTGCAAGAA CTTGTGQAAC 840 

15 TCATCTATCA ACAGGOCACA TCTATCCCAA ATTTCTCTTA TATGGGAGCT CGCCTGTGTA 900 

ATTACCTGTC CCATCATCTG ACAATTAGCC CACAGACTGG CAACTTCCGC CAATTGCTAC 960 

TTCAAAGATG TCGGACTGAA TATGAAGTTA AAGATCAAGC TGCAAAAGGG GATGAAGTTA 1020 

20 

CTCGAAAACG ATTTCATGCA TTTCTACTCT TTCTGGGAGA ACTTTATCTT AACCTGGAGA 1080 

TCAAGGGAAC AAATGGACAG GTTACAAGAG CAGATATTCT TCAGGTTGGT CTTCGAGAAT 1140 

25 TGCTGAATGC CCTGTTTTCT AATCCTATGG ATGACAATTT AATTTGTGCA GTAAAATTGT 1200 

TAAAGTTGAC AGGATCAGTT TTGGAAGATG CTTGGAAGGA AAAAGGAAAG ATGGATATGG 1260 

AAGAAATTAT TCAGAGAATT GAAAACGTTG TCCTAGATGC AAACTGCAGT AGAGATGTAA 1320 

30 

AACAGATGCT CTTGAAGCTT GTAGAACTCC GGTCAAGTAA CTGGGGCAGA GTCCATGCAA 1380 

CTTCAACATA TAGAGAAGCA ACACCAGAAA ATGATCCTAA CTACTTTATG AATGAACCAA 1440 

35 CATTTTATAC ATCTGATGGT GTTCXnTTCA CTGCAGCTGA TCCAGATTAC CAAGAGAAAT 1500 

ACCAAGAATT ACTTGAAAGA GAGGACTTTT TTCCAGATTA TGAAGAAAAT GGAACAGATT 1560 

TATCCGGGGC TGGTGATCCA TACTTGGATG ATATTGATGA TGAGATGGAC CCAGAGATAG 1620 

40 

AAGAAGCTTA TGAAAAGTTT TGTTTGGAAT CAGAGCGTAA GCGAAAACAG TAAAGTTAAA 1680 

TTTCAGCATA TCAGTTTTAT AAAGCAGTTT AGGTATGGTG ATTTAGCAGA ACACAAGAGA 1740 

45 GCAAGAAAAT GTGTCACATC TATACCAAAT TRAGGATGTT GAGTTATGTT ACTAATGTAT 1800 

GCAACTTTAA TTTTGTTTAA CACTATCTGC CAAAATAAAC TTTATTCCCT ATAACTTAAA 1860 

ATGTGTATAT ATATATAATA GTTTATTATG TACAGTTAAT TCTACTGTTT TGGCTGCAAT 1920 

50 

AAAATCGATT TTGAAATAAA TGAAATGTTG AAAATTTTGC TAGTTGGTTA GATGCTTATC 1980 

CTTTAAATTC TACTTTTCTT GAGGGGAAAA AGTCTTCGTC TGGAAATACA TATTACTGCA 2040 

55 AAAATGTAGC ATCCTTTTTT AGGTAGGAGT ATTATAGCTT YCAITTTAGT TKGACATTTA 2100 

GTGTCCCAAT GAATTGAATT TCAAATATGA ATCATAATCT TGAAAATCTT TAGCACTAAA 2160 

GTCTTGGGAA TATATCAACA ACTGATTTAC ATATGCAGAT GCTATTTGNA TACCAAGGGC 2220 

60 
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TITTTAAATG TCATGGGGGG GAAAAACCCA 
GACTTCACTG GAAGATTTAT TCCAATTCTA 
5 AACTGRCTAA CTTCATTACC TTAAAGCCTA 
TCTCACITTT ATTTTC3TAGC AKGGGTTGCA 
TATTTGTCAT TCAAGTTTTC ATCTGCTTTA 

10 

AATACTTTTA CTATAATGTG C?rACCACCTC 
TCAAATCTTT TTCCAGCTAA CTAAAAACTG 
15 GTAAATAGTT CTGTTAATAA CCCACTGTTT 
AGTTAGCTTT CTCACTTTTC TGCTTGTTTG 
ATAAAGCTTA AAAAAAAAAA AAAAAAAAAA 

20 



ACTTGGTGGA ACTCCCAGCT AAACAACCAA 2280 

GGAATTGTTC TTTTTTATTT TrATTTTTTC 2340 

GAACATTATT CTGCTTTATT TATATGGCTT 2400 

TCGACTTTTT TACTAGAGAA TTTTACTAGA 2460 

TAATTGATAC ACCTTGAGGG TCACTTTTCT 2520 

AGCCCTAATA AATAATATTT TTACCTAATG 2580 

TGTACAAAAG GATTGCTTGT AAATATGCAT 2640 

TACATTTGGT ACATCTGTGT CTGCTAATAC 2700 

TTCAGTCTGA ATTAAAATTA GACTTTGAAA 2760 

AAAAACTCGA G 2801 



25 



(2) INTORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 
30 (D) TOPOLOGY: linear 



35 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

ATGGAGCCAA AGACAATCAC TGATGCTTTG GCTTCTAGTA TAATTAAGAG TGTGCTGCCT 60 

AATTTTCTTC CATACAATGT CATGCTCTAC AGTGATGCTC CAGTGAGTGA ACTGTCCCTC 120 

GAGCTGCTTC TGCTTCAGGT TiSTCTTGCCA GCATTACTCG AACAGGGACA CACGAGGCAG 180 

40 TGGCTGAAGG GGCTGGTGCG AGCGTGGACT GTGACCGCCG GATACTTGCT GGATCTTCAT 240 

TCTTATTTAT TGGGAGACCA GGAAGAAAAT GAAAACAGTG CAAATCAACA AGTTAACAAT 300 

AATCAGCATG CTCGAAATAA CAACGCTATT CCTGTGGTGG GAGAAGGCCT TCATGCAGCC 360 

45 

CACCAAGCCA TACTCCAGCA GGGAGGGCCT GTTGGYTTTC AGCYTTACCG CCGACCTTTA 420 

AATTTTCCAC TCAGGATATT TCTGTTGATT GTCTTCATGT GTATAACATT ACTGATTCCC 480 

50 AGCCTCATCT GCCTTACTTT ACCAGTATTT GCTGGCCGTT GGTrAATGTC GTTTTGGACG 540 

GGGACTGCCA AAATCCATGA GCTCTACACA GCTGCTTGTG GTCTCTATGT TTGCTGGCTA 600 

ACCATAAGGG CTGTGACGGT GATGGTGGCA TGGATGCCTC AGGGACGCAG AGTGATCTTC 660 

CAGAAGGTTA AAGAGTGGTC TCTCATGATC ATGAAGACTT TGATAGTTGC GGTGCTGTTG 720 

GCTGGAGTTG TCCCTCTCCT TCTGGGGCTC CTGTTTGAGC TGGTCATTGT GGCTCCCCTG 780 

60 AGGGTTCCCT TGGATCAGAC TCCTCTTTTT TATCCATGGC AGGACTGGGC ACTTCGAGTC 840 
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CTGCATGCCA AAATCATTGC AGCTATAACA TTGATGGGTC CTCAGTGGTG GTTGAAAACT 900 

GTAATTGAAC AGGTTTACGC AAATGGCATC CGGAACATTG ACCTTCACTA TATTGrTCGT 960 

5 

AAACTGGCAG CTCCCGTGAT CTCTGTGCTG TTGCTTTCCC TCTGrTGTACC TTATGTCATA 1020 

GCTTCTGGTG TTGTTCCTTT ACTAGGTGTT ACTGCC3GAAA TGCAAAACTT AGTCCATCGG 1080 

10 CX5GATTTATC CATTTTTACT GATGGTCGTG GTATTGATGG CAATTTTGTC CTTCCAAGTC 1140 

CGCXAGTTTA AGCGCCTTTA TGAACATATT AAAAATGACA AGTACCTTC3T GGGTCAACGA 1200 

CTCGTGAACT ACGAACGGAA ATCTGGCAAA GAAGGCICAT CTCCACCACC TCCACAGTCA 1260 

15 

TCCCAAGAAT AAACTAGTTG TCTCAACAAC TTGACCTTCC CCTTTACATG TCCTTTTTTC 1320 

TGGACTTCTC TCTTTGGAGA TTTTTCCCAG TGATCTCTCA GCGTTGTTTT TAAGTTAAAT 1380 

20 GTATTTGACT TGTGTTCTCA GCATICAGAG AGCAGCGGTG TAAGATTCTG CTGTTCTCCC 1440 

TGGATCTTCT GACATTACTG CTGTCTGAGA TTTGTATATG TGTAAATACA AGTTCCTTGA 1500 

TACCCTAAAA CCTTGGATTA AACAGAATGT GCATTGTACA TCTTTAAACA AAATGTATAT 1560 

25 

TAATTTATTA AATCTAGTTG TCACTTTAAA AAAAAAAAAA AAAAAACTCG AGGGGGGCCX: 1620 

GOTACCCAAA T 1631 

30 

(2) INFORMATION FOR SEQ ID NO: 98: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 
CB)"TypEl "nucleic" acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CCGAGCTGGG CGAGAAGTAG GQGAGGGCAC GAGCCGCCGC GGTOGCGGTT GCTATCGCTT 60 
45 CGCAGAACCT ACTCAGGCAG CCAGCTGAGA AGAGTTGAGG GAAAGTGCTG CTGCTGGGTC 120 
TGCAGACGCG ATGGATAACG TGCAGCCGAA AATAAAACAT CGCCCCTTCT GCTTCAGTGT 180 
GAAAGGCCAC GTGAAGATGC TGCGGCTGGA TATTATCAAC TCACTGGTAA CAACAGTATT 240 

50 

CATGCTCATC GTATCTGTGT TGGCACTGAT ACCAGAAACC ACAACATTGA CAGTTGGTOG 300 
AGGGGTGTTT GCACTTGTGA CAGCAGTATG CTGTCTTGCG GACGGGGCCC TTATTTACCG 360 
55 GAAGCTTCTG TTCAATCCCA GCGGTCCTTA CCAGAAAAAG CCTGTGCATG AAAAAAAAGA 420 
AGTTTTGTAA TTTTATATTA CTTTTTAGTT TGATACTAAG TATTAAACAT ATTTCTGTAT 480 
TCTTCCAAAA AAAAAAAAAA AAAA 504 

60 
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(2) INFORMATION FOR SEQ ID NO; 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dbxible 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

15 ^^^^^^^^'^^^ AGGGAGCCCT CTCCGTTGGG TGACTCTTGT GTC5CCCTTTA GACAGGCTCG 60 

CCTGCCGGTT CCACAGGGTA CAGTTAGGAC TTGAGTCTTT CTTTTTCTGT TTTGAGTTGG 120 

TGAGTGAGTG ATAGGGTAAC ATGGGCCTTC AGGATGACCC CTTGGAACTC TGCCGAGTTC 180 

20 CTTAAATCTC AGCTGGGATC CTGGACCTGG GAGGCCCCTG TGAGGGCCAG CTCTCGAAAA 240 

ACCTGGGAGT TGATGCCGGA GCTGTGGAAG AACTCTGCTC GAGGGCAGGG TGCCCTOGAA 300 

CACTGGTAGT TCTGGGGCTG GGAGGGAGAG GGGCTCCGGC TTTCTCTGAA ATGAACACTG 360 

CTCTTCAGCA GTTCAAGTAC TTGTTCTCAA AACATTTTCT AATTCATTGG TAGGTTTTCA 420 

TAAGCATTGT TTCTTTAAGG CATGGAAAGG GAAGAATGCT CAAGCAAGTC ATGTTTGTTr 480 

30 TCAGTGGGAT GGGCCCGCGT TCTCACTGCT GGGGGCTTCC CCTTCATCTO GCACCTTTCT 540 

GCAGGGGCCA CCAGGCAGAC TCTTCCCACC TTCTCCCACT GAAGCACCAA GGGGCTIGGA 600 

ACCGTAATTT GQCTAATCAG AGGCATTTTT TTTGTCCTAG TATCTTTCAC ACTTCTCCAA 660 

CCGTCTTATT TTTTTAAAAG TTCTGTTGCT TGTATTAACA CGAAACTAGA GAGAAATAGT 720 

TTCTGAAGCC AGTTTATTGT GAAGATCCCC A7VGGGGAGGT TCGGTAGAGA AAAATAGTAA 780 

40 GCTGGTTTAG AAACTGACGA GGGCAAACAG CCAGGACGCA TTGGAGAGGA ATTTCCCAAA 840 

GATCTACCCT GAGATAACGC CTGTCCAGTG TCTTCACCAC GTGAATAACC AGCGCTCCAA 900 

AGTGTmrC TQCTTTGAAA AAAAAAATTC CACAAGCTTT TAAAGGTGCA TTTAAGAATC 960 

45 

CATGTGACTT TAGAATGGAA CTGCCGGCCC TGGCAACTGT CACGTGTGCT AGAAGGTTCG 1020 

ATGCCTCTGG AATGCATGTG ATACTCATCT CCATTTTGTT TCCITCATTC CATTTTTGTT 1080 

50 CTTTTAGCAG ATCTGTCCCT GTGGGTGGTG TCTAAGAAGT CGGACACCTT GGTTTTTGTG 1140 

TTAGATTGAG CTGGGCAGCT GCAATCAGCT TCTTTATATG CAAATTAGGC ACGACCCATC 1200 

TGTGGnCCT GGTTGGTGGC TAATGAAGTG AGGGGAGGGA GGGATGTCAC CCCAAAAGTA 1260 

GGCCCTCCCA TTGGCTTTGG CCAGGCCAGA CACTTCACAT CGTTTACATC GTTCTGTCTA 1320 

ATTTTAAAGT TTATGTGTAT AAAGCGAAGC TGTTTCTGTG AAACTGTATA TTTTCTAAAT 1380 

60 AAATATATTG CTACTTGAAA AAAAAAAAAA AAAAAA 14X6 



SDOaD: <WO ^983944aA^I_> 



wo 98/39448 



322 



PCTAJS98/04493 



5 (2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LEMGTO: 2847 base pairs 

(B) TYPE: nucleic acid 
10 <C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

15 GGCTAGGACA ATTTTGGTGC TTTACCTATC TCTGCAAAGA CTGQAGAATT TGGCATACCA 60 

TTAATTACAA CCACCAATCA TATCCAACAA AAGTACCCTA AAAGAAGGAC CAGTGGCCAC 120 

TCTCGAAAAA ATTTAAGTAT CAGAAGATTA AAAAGATTTT AGGATTTGGA AGCTTGTATT 180 

20 

GTCTTTCCCC AATAATCATT GTTTGATCTC CAAATAGTAG CCTTATATTA GCAATRGACA 240 

GATCATTGGT TCTCCATATC TGATCATATG TTACTACTTT GGAATCAGTA TTTGGGCAAA 300 

25 TTCAAGCATT TATGCAGTGG ATATAAATGG AAATATAAAA ATATTTGCCA ACCTGTCTCA 360 

GTAACTTATC ATATCTCTGT GNATCCTCAA GGAAAGCACT TTTGCTTTTA CTTAGAAAGC 420 

GTTTCAGATT TGCTTTATAG ACTCCTGCTG TCTTCAGTAC CTGATAAAAC TTTAACCAGG 480 

30 

GAAGCATTAA ACACAGTGCA GCAGCTTTTG CCCAGGCTTC TAAGTTCCTG CCGGCAGCAT 540 

TTATCAATGT AAGAACTAGG ATGCTTCCTG CAGTGGCACT ACCTTCCCCT AGAGCTGGAG 600 

35 CATGCTGCTT GGCCTTAAGC CCCAGCATGA TGAGGCTTCC CTCCTGCCAG GTCAGTAAAA 660 
"GTTAGAGAGC' TCAGAA1TXSG~GTX^^ ' 720' 

TAAAGAGAAG TCACCAAGGG AGGCAGGTAA TGAATGTTTC CAGAATCAGT CKGATACTCA 780 

40 

TAGCAATTTC TGGCTATCTT TCAAATGTTG AATTTCTGGA TGCTGAGAGG GACTTTGATT 840 

TGATATCATT AAATCCAGGA CAGTCCCAAG AAGTGCTTGG AGTCTCGGCT CTGACAGCCC 900 

45 AAGAAGGGAA ATAACTTGTA TTAAGGAACA ACTATGAGCC AGGCCCTGAG CTGTCTCTTA 960 

GATAATAAAA CAGATGGGGA GTGGAAGAGT CATTTGCTTC AAGTTATACA GCTAGGAAAT 1020 

ACTCAAGCCA AATCTTGAAC GCAGCTCCCC CTAATTCTGT GGACAGGCAC TTTGTACCAC 1080 

50 

ACACCATGGT CCACCTAAAA ACAGAAGGAT AAAAAGACTT CAGGTTTTCC CACTGTGTGC 1140 

TGACCATCCC AATTTATGAA TCTTCTTCAA AATGACATTT CACAGTTATA GTTAGGGCTC 1200 

55 AGAAATGGCA TTGAGGTAGC CTTATTTCTC CCCTTTAGCA GATGCTTTAA GTACACATTG 1260 

CTGACTTG7VG CCCACCCCCA GGAGTTAGGA GAACATTTCC TTTTTCATGC CATCTTCCAT 1320 

AAATAAGGTG TTTCTTGGCC TTCAAAGATA TAGAACTTTG CAGCAGTAGT AAAAGTGAAG 1380 

60 
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GGTGTTCTGC TCTCTACTCA ACTTTATTTG AAAATGTCTG CAGCTTCACT CCTGTAGAAA 1440 

AGGAAATCTT CATATTTTAG TAAACTTAGC CXXrCAGTGTA CTCTGTGAGG ATGTOGCAAT 1500 

TCAAAGTCCA GTGAATCTGG CTCTCTTACT GATTCCTCGT TTTAGTGTC3T GTGTCGGGGG 1560 

AGTGTCTACX: TATATATAAA GGACAAGTGT GATATGTGTG TATATGTATA TACATACATA 1620 

CATGTCCACA CACACACACA CAATATTTGA GAGCTAAGGA AAACTCAAAG CAGCCCCTTC 1680 

ATTATCITGC GTACTACTTC AAAGATTTCT GTCAGCCCTA ATTACAAGTC TCACXATATA 1740 

GTTGGGGCTT AGC5TACTTGC TTACAGGAAG AGCAATTCCC TAGCAAAGGT CATTAGCTCC 1800 

TAAGGCACTG AGTCAAAGTG ACAGCXTCTGA AGGAAATTCC ACTCCAGCCC TCCTCCAGGA 1860 

TGTCTAATAA GATGGGAAAC TTGGATGCCC AGCCATTTTG GTGACCTCAG AGTCTAACTA 1920 

CTCCAGTTAG ACCTAAGGGC ACAAATGCAG AATTCATGAC CTTGTAGTTG TCGCAGGGTC 1980 

TAGGAAGTCC TCTCTCCCCA AGTAGAAAAT ATTCTCTTGC CATTCCTGAA ATTCCACATT 2040 

CATATAATGG CTCTGCAATA CATGCTTCTC AATAAGAAAA TTAACTGCAT GTTTACTCTG 2100 

TGCTGATCAC ATCAGATTTT TATGTTTAAA AAAATCTCAT TATGGNTTGA GTCCAGCCCA 2160 

GCTCTAAGAG AAAAAGAAGG CCCATATGGG AGACTTCAGT CTCATTATTA TTGCCTTTAT 2220 

CCAGCAGTGC TTATRAAGCC CCCTACCCTG TCCCATTCCA GAAACCATAA GACTCAGGCA 2280 

GTTCTTGATT CTGGAGGCCT GCCTGGTAAG ATAAGATAGT ATAATTTCGA ACTCAGAACA 2340 

TACCAGAAAC AGCAGAACGA GGGCCAGAGC AGAAAAATGA AT^TAAGTGG AGACACTTAT 2400 

GGATACATTG GTGCAAAAAA AGCCACGGGS CCCATACTGG GCTTGATATG ACTTTGAGGG 2460 

GACAGCAGAT-l^ATACTTAA-TGAGGGTTAA- ACCTGACC^ "TCTTTOTACA 1STOAME3GCC 2S20 ' 

ACACTGCATG AATGGGGAGA ACCAATGAAT CCATTGTCCT CTGCCTATTT TCCTGTGCAC 2580 

AGTCACATTC CCTCCTTAGG AATCTTCCCC TTCCACCCTT TACATTAAAC AAGGGAACAC 2640 

TGAATCTTTC AAGGGAATTA CACGTTTGGG TTAATGTTTC AGTATATCAT TTTCATACTG 2700 

TAAATTATTT TGTAAGAGAG ATTTACTGCT ATCCXAGGAT GTTCGGACTT GGTGCCCCTC 2760 

TGCATTTGGA AATCAATAAA CTATTACTGG AAATGCCAAA AAAAAAAAAA AAAAAAAAAN 2820 

NAAAAAACTC GAGGGGGGCC CGTACCC 2847 



55 



(2) INFORMATION FOR SEQ ID NO: 101: 



(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 1394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
60 (D) TOPOliOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

GAGATTGGTG GAGGAGAGTA AATAATCTAG AGGCAAGAGT TCAGTGAGGG CCAAGGGC3GA 60 

CCCCCAGAAA AAGGTATGGA GCTAACTCAT CTCTTTTACA AGGGGTGGCC ATGACTTACT 120 

GTTGCAAAGT ACTCAGTGTA TATTTAATGT TGATTGTTGA ATTTTAGTTA CGAGAGGGAA 180 

GAACAATTTT ACTTCTGTCC TTATTTCACT TGCTGAAAAG CTGTGGGACA AAATGTATQG 240 

AATAGACAAG GCCACTTTCT TTOTGATTTC TGCTTTTCAT GCATATTATT TTATTTACCC 300 

ATAATTTCCA AGAGGTTTGG CGTTCCGCTC TCCTC3CTTTT TTCTTTCATC CACCCCTTTC 360 

CTTTTTTTGG AAGGGGGTTA TATATGAGAG TTCATTGAAG AAGTCCAGTG AGGCTGAAGT 420 

AAAGGGGCAA GATAGGGCAG TTAACTAAAG AGCACTTTAT TTCTTTGAAG CCTTTCTAAG 480 

AAAGAAATGG GGGTOCGAGT GGCTTGAATC TCCCATGATG TTGGAGGGCA CTTAGTGGOG 540 

TTGAAGTATG ACATAATATT TCCCATTGGG GAAAGGAGAA TTTCTCTTAG AGGGTGOCAA 600 

AATGCCTTTG CCCAGTGTCC CTATTTTAGG CATCTTTTCC TTCCTTATTC CTTCCAGTCA 660 

GGGTGTGTCC TATACAAAAC TTCCCATCAG ITCTCCTCAA TATTCCCCAT TTGTAAATGA 720 

TCACTTCTCT TTTCTAAACC CTTTTCCTGT TCAGATCCAT ACAGGATTTG CAAGGGTAGG 780 

ATCATACATG CAAATGCCCC TTGTTCATCT GTGTCTTCTG CAAACTAGTC TCATGAAGAA 840 

TTCTGGCGTG CAGCAGGGTA GCTGAAGTTT GGGTCTGGGA CTGGAGATTG GCCATTAGGC 900 

NTCNCTGAGA TTCCAGCTCC CTTCCACCAA GCCCAGTCTT GCTACGTGGC ACAGGGCAAA 960 

CCTGACTCCC TTTGGGCCTC AGTTT CCCCT CC CCT TCATG AAATGAAAAG AATACTACTT 1020 

TaTCTTGTTG GTCTAGCATT GCTGGACACA AAGTGTAGTC ATTATTGTTG TATTGGGTGA 1080 

TGTGTGCAAA ACTGCAGAAG CTCACTGCCT ATAAGAGGAA ATAAGAGAGA AAGTGGAGGA 1140 

GAGGGACAAA AGGAGTAATT ATTTGGTATA GATCCACCCA TCCCAACCTT TCTCTCCTCA 1200 

GTCCCTGCTC CTCATGTTTC TGGTTTGGTG AGTCCTTTGT GCCACCACCC ATAATGCTTT 1260 

GCATTGCTGC ATCCTGGGAA GGGGGTATAT GGTCTCACAA GTTGTTGTCA TIGTTTTTTT 1320 

GCATGCTTTC TTAATAAAAA AAAAAAAAAA ATGTTTANAG TTTTATCTTA AAAAAAAAAA 1380 

AAAAAAAAAA ACCC 1394 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOIjOGY': linear 
(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 102: 

5 GGMRCGAGGC GGAGTAAAGG GACTTGAGCG AGCCAGTTGC CGGATTATTC TATTTCXXXTT 60 

CCCTCTCTCC CGCCCCGTAT CTCITTTCAC CCTTCTCCCA CXXTTCGCTCG CGTACCATGG 120 

CX3GAGCGTCG GCGGCCACTC AGTCCCATTC CATCTCCTCG TCGTCCTTCG GAGCCGAGCC 180 

GTCCGCGCCC GGCGGCGGCG GGAGCCCAGG AGCCTGCCCC GCCCTCGGGA CGAAGAGCTG 240 

CAGCTCCTCC TGTGCGGTGC ACGATCTGAT TTTCTGGAGA GATGTGAAGA AGACTGGGTT 300 

15 TGTCTTTGGA CACGCTGATC ATGCTGCTTT CCCTGGCAGC TTTCAGTGTC ATCARTGTGG 360 

GTTTCTTAMC TCATCCTGGC TCTTCTCTCT GTCACCATCA RCTTCAGGAT CTACAAGTCC 420 

GTCATCCAAG CTGTWCAGAA RTCAGAARAA GGCCATCCAW TCCAAAGCCT ACCTGGACGT 480 

20 

AGACATTACT CTGTCCTCAG AAGCTTTCCA TAATTACATG AATGCTGCCA TGGTCCACAT 540 

CAACAGGGCC CTGAAACTCA TTATTCGTCT CITTCTGGTA GAAGATCTGG TTCACTCCTT 600 

25 GAAGCTGGCT GTCTTCATGT GGCTGATGAC CTATGTTGGT GCTGTTTTTA ACGGAATCAC 660 

CCTTCTAATT CTTGCTGAAC TGCTCATTTT CAGTGTCCCX5 ATTGTCTATG AGAAGTACAA 720 

GACCCAGATT GATCACTATG TTGGCATCGC CXZGAGATCAG ACCAAGTCAA TTGTTGAAAA 780 

GATCCCAAGC AAAA 794 



30 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 103: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1544 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

TTTGCTTGCT AGTCTGAACC AAAGAGTTGT TTGGGCATTT GCTGTGTTGG CCATTTCTGG 60 

AGCAAGAGGG TCTTCTTCCT CCTTCCCCCA GCCAGCCAGC TGTCCTGGGG CCAGGCTTTC 120 

50 CTGGGTGGAA AGAAGTATAC CTTTCCCTGG GGCCCTAGGA TAGCAAAGTG AGCCATAGTG 180 

GGCCAGGCTG CCCTCCATGC TGGGCCCCAG CCCAGGTCTG CACTCGCCTG GATCACCTTC 240 

TTTGAGCCTT AGCCATCTCC TGTCAGGTAG GAATGAACTT GCCAGCCTTC AGGYTCGTTC 300 

AGCTATGACC ATCTGTGCGG TCAGGGTACA CTCAGCTCTC CTCCCCAACT CCAGCAGCCT 360 

TTAAGAAGTG TCCCTTTGGC GCCCCCTQGA GGCAGAGCAC TGAGCTGGAC CCTGGGTAGA 420 

60 CTCCCACAGG GAGGACGGAG CTGGCCTCAG GAGTGGGACA CCCAGACTTG GCAGQGCCTT 480 
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CAAGAGGCCT GTGTGGGGGC CCXIAGGAATC CTTAGCTGAA GCGGGGAGAC TCACTCTCCA 540 

TCrCAGGAAA TTCTAGCCCT TGCCCTCAGG GAGCCACGGT TGAGGGTGAG GCCCAACACC 600 

TGCGTTAGGG CCCTGGGTGG GCAAGTCTGG GCCCTGQGGT AGGGAGGGAG ACTCAGGCCC 660 

ACACTTGGGT ATTTTCTAAT TTCAGACAAA CACACACTCA GCGCGCACTC ACTGATTCCT 720 

ACACATTGCC AAGATTTCAC ACATGTGACC AGGGGCCACC AAAGTCCXTTG TGACCTTTGT 780 

GACTAGGATC CTAATTTCTC TATTTTCTCC TGGGTGCCTG GGTCTGTGTC ACCTGGGGCA 840 

GTGTGGATAA TGTTTAGTTC TGTGACACTG TTTTTTGGGG GTGGCACCTG GnTCTCCGAT 900 

GCCTGGGCTG GTGTCAGGCC CAGGACTGTA GTGCTGGGAG CAGTAAAGCT CAGCTCTGTG 960 

TAATGAGTGA TGCTATQGCT TGCTCGTGTC TTATGATCCA ATCCTTTTCT ACATCAGCCC 1020 

TTGTTTTGTT TTATGGCTAG TCTTATCTGG CCTGGTTATT TCCTTGCGGG GAGGAGAGGG 1080 

TTTGCTAATC TGCTCCCAGC CCAACCTATT ACCACCCCAC CTCGCTGGGA CXrTACTGCTC 1140 

GGGAGGCAGC AGACAGGGAG CCACCAGCAG TGGCTTCCTG GCCCTGTGCT GGGGGTGGGG 1200 

GGAAGCTGGG GGCACATGTG GCCCTTGCCT TCTGAGCAGC TCCCAGTGCC AGGGCTTTGA 1260 

GACTTTCCCA CATGATAAAA GAAAAGGGAG GTACAGAAGT TCCAATTCCC TTTTTATTTT 1320 

GCTGGTTGGT ATCTGTAAAT GTrTAATAAA TATCTGAGCA TOTATCTATC AACGCCAAGA 1380 

ATTTCAAAGT CTCCTTCAAC AATATGAGGC TTTTAGGATG TTTATATTCC TTCATCCCTC 1440 

TTGTTTCCCA GGTTTTGCAG GGAAAAAAAG TCTGGAATTA TAGATACAGC TrTATTATTAA 1500 

ATTTGTTCTT GCATAAAAAA AAAAAAAAAA AACNOSINGGG, GGGG . 1544 
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(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
ACCCACGCGT CCGNCTTGTC CACCCGGGGG CGTGGGAGTG AGGTACCAGA TTCAGCCCAT 
TTGGCCCCGA CGCCTCTGTT CTCGGAATCC GGGTGCTGCG GATTGAGGTC CCGGTTCCTA 
AGGTGGGTCG CTGTCCACCC GGGGGCGTGG GAGTGAGGTA CCAGATTCAG CCCATTTGGC 
CCCGACGCCT CTGTTCTCGG AATCCGGGTG CTGCGGATTG AGGTCCCGGT TCCTAACGGA 
CTGCAAGATG GAGGAAGGCG GGAACCTAGG AGGCCTGATT AAGATGGTCC ATCTACTGGT 
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CTTGTCAGGT GCCTGGGGCA TGCAAATGTG GGTGACCTTC GTCTCAGGCT TTCCTGCTTT 360 

TCCGAAGCCT TCCCCGACAT ACCTTCGGAC TAGTCCAGAG CAAACTCTTC CCCTTCTACT 420 

TCCACATCTC CATGGGCTGT GCXTTTCATCA ACCTCTGCAT CTTGGCTTCA CAGCATGCTT 480 

GQGCTCAGCT CACATTCTGG GAGGCCAGCX: ACXTTTTACXTT GCTGTTCCTG AGCCTTACGC 540 

TGGCCACTGT CAACGCCXX3C TGGCTGGAAC CCCGCPiCCAC AGCTGCCATG TGGGCCCTGC 600 

AAACCGTGGG AGAAGGAGCG AGGCCTGGGT GGGGAGGTAC CAGGCAGCCA ACAGGTTCCC 660 

GATCCTTAAC GCCAGNTGCG AGAGAAGGAC CXX:AAGTACA GTGCTCTCCG CCAGAATTTC 720 

15 TTCCXXTACC ATGGGCTGTC CTCTCTTTGC AATCTGGGCT GCGTCCTGAG CAATGGGCTC 780 

TGTCTCGCTG GCCTTGCCCT GGAAATAAGG AGCCTCTAGC ATGGGCCCTG CATGCTAATA 840 

/vatgcttctt cagaaaaaaa aaaaaaaaaa a 871 
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(2) INFORMATION FOR SBQ ID NO: 105: 



(i) SEQUEJTCE CHARACTERISTICS: 

(A> LENGTH: 404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOIiOGY: linear 



<xi) SEQUENCE DESCRIPTION: SBQ ID NO: 105: 

ggcacgagtt atagcatggc attcatactt ttgttttatt gcctcatgac TTTTTTGAGT 60 

TTAGAACAAA ACAGTGCAAC CGTAGAGCCT TCTTCCCATG AAATTTTGCA TCTGCTCCAA 120 

AACTGCTTTG AGTTACTCAG AACTTCAACC TCCCAATGCA CTGAAGGCAT TCCTTGTCAA 180 

40 AGATACCAGA ATGGGTTACA CATTTAACCT GGCAAACATT GAAGAACTCT TAATGTTTTC 240 

TTTTTAATAA GAATGACGCC CCACTTTGGG GACTAAAATT GTGCTATTGC CGAGAAGCAG 300 

TCTAAAATTT ATTTTTTTAA AAAGAGAAAC TGCCCCATTA TTTTGGTGGG GTTGGTTTTT 360 

AATTTNTAAT NTGAAAAATT TTTTTGGGGT TTTTGGGGCC ATGG 404 



<2) INFORMATION FOR SEQ ID NO: 106: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 beise pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
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arCAGACAGG TGGAGCCGCC GGGGCAGGAG TCTCAAAGAG OiAGGCTCC^ GGAGAGGAAG 60 

GGCTCTRCrGA GAGGAGAGAG GAGAGCGCTG GAGAGGAQAG GCTGGAGAGT CCTTAGCCAG 120 

5 GATGGAGGCT GTTGTGAACT TGTACCAAGA GGTGATGAAG CACGCAGATC CCCGGATCCA 180 

GGGCTACCXrr CTGATGGGGT CCXTCTTGCT AATGACCTCC ATTCTCCTGA CCTACGTGTA 240 

CITCGTTCTC TCACTTGGGC CTCGCATCAT GGCTAATCGG AAGCCCTTCC AGCTCCGTGG 300 

10 

CTTCATGATT GTCTACAACt TCTCACTGGT GGCACTCTCC CTCTACATTG TCTATGAGTT 360 

CCTGATGTCG GGCTGGCTGA GCACCTATAC CTGGCGCTGT GACCCTGTGG ACTATTCCAA 420 

15 CAGCCCTGAG GCACTTAGGA TGGTTCGGGT GGCCTGGCTC TTCCTCTTCT CCAAGTTCAT 480 

TGAGCTGATG GACACAGTGA TCTTTATTCT CCGAAAGAAA GACXXX3CAGG TGACCTTCCT 540 

ACATGTCTTC CATCACTCTG TGCTTCCCTG GAGCTGGTGG TGQGGGGTAA AGATTGCCCC 600 

20 

GGGAGGAATG GGCTCTTTCC ATGCCATGAT AAACTCTTCC GTGCATGTCA TAATCTACCT 660 

GTACTACGGA TTATCTGCCT TTGGCCCTGT GGCACAACCC TACCTTTGGT GGAAAAAGCA 720 

25 CATGACAGCC ATTCAGCTGA TCCAGTTTGT CCTGGTCTCA CTGCACATCT CCCAGTACTA 780 

CTTTATGTCC AGCTGTAACT ACCAGTACCC AGTCATTATT CACCTCATCT GGATGTATGG 840 

CACCATCTTC TTCATGCTGT TCTCCAACTT CTGGTATCAC TCTTATACCA AGGGCAAGCG 900 

30 

GCTGCCCCGT GCACTTCAGC AAAATGGAGC TCCAGGTATT GCXAAGGTCA AGGCCAACTG 960 

AGAAGCATGG CCTAGATAGG CGCCCACCTA AGTGCCTCAG GACTGCACCT TAGGGCAGTG 1020 

35 TCCGTCAGTG CCCTCTCCAC CTACACCTGT GACCAAGGCT TATGTGGTCA GGACTGAGCA 1080 

GGGGACTGGC CCTCCCCTCC CCACAGCTGC TCTACAGGGA CCACGGCTTT GGTTCCTCAC 1140 

CCACTTCCCC CGGGCAGCTC CAGGGATGTG GCCTCATTGC TGTCTGCXrAC TCCAGAGCTG 1200 

40 

GGGGCTAAAA GGGCTGTACA GTTATTTCCC CCTCCCTGCC TTAAAACTTG QGAGAGGAGC 1260 

ACTCAGGGCT GGCCCCACAA AQGGTCTCGT GGCCTTTTTC CTCACACAGA AGAGGTCAGC 1320 

45 AATAATGTCA CTGTGGACXrC AGTCTCACTC CTCCACCCCA CACACTGAAG CAGTAGCTTC 1380 

TGGGCCAAAG GTCAGGGTGG GCGGGGGCCT GGGAATACAG CCTGTGGAGG CTGCTTACTC 1440 

AACTTGTGTC TTAATTAAAA GTGACAGAGG AAACCANAAA AAAAAAAAAA AAAAACTCGA 1500 

50 

GGGGGGCCCG TACCCAAATC GCCGGTATGA TCGTAAACAA TC 1542 



55 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I.ENGTH: 2327 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 107: 

GGTAGCTCAN TGCAGTGAAA TAGTCTTACT GGAAACAAAG CXXTTTTATCA AGAATAATTA 60 

ACTCTTCCCT TTTCTTTTTG GAGAGGTGCT TTGTTTCTGA TCGGACCATT TCACTGCAGC 120 

AAGCAACACA GTATTCTRAG CAGAAGATCG GGACTTGAGG CCATGTTGCX; GAGGGCCAGT 180 

RACATTATCT GGACTCTGGA GTC3TGAQGAA TATGGACTCC ACTCTTCACT ATATTCACAR 240 

CGATTCAGAC TTGAGCAACA ATAGCAGTTT TAGCXXTGAT GAGGAAAGGA GAACTAAAGT. 300 

ACAAGATGTT GTACXTTCAGG CGTTGTTAGA TCAGTATTTA TCTATGACTG ACCCTTCTCG 360 

TGCACAGACG GTTGACACTG AAATTGCTAA GCACTGTGCA TATAGCCTCC CTGGTGTGGC 420 

CTTGACACTC GGAAGACAGA ATTOGCACTG CX^XSAGAGAG ACGTATGRGA CTYTGGCCTC 480 

AGACATGCAG TQGAAAGTTC GACGGAACTC TAGCATTCTC CATCCACGRG CTTGCAGTTA 540 

TTCTTGGAGA TCAATTGACA GCTGCAGATC TGGTTCCAAT TTTTAATGGA TTTTTAAAAG 600 

ACCTCGATGA AGTCAGGATA GGTGTTCTTA AACACTTGCA TGATTTTCTG AAGCTTCTTC 660 

ATATTGACAA AAGAAGAGAA TATCTTTATC AACTTCAGGA GTTTTTGGTG ACAGATAATA 720 

GTAGAAATTG GCGGTTTCGA GCTGAACTGG CTGAACAGCT GATTTTACTT CTAGAGTTAT 780 

ATAGTCCCAG AGATGTTTAT GACTATTTAC GTCCCATTGC TCTGAATCTG TGTGCAGACA 840 

AAGTTTCTTC TGTTCX3TTGG ATTTCCTACA AGTTGGTCAG CGAGATGGTG AAGAAGCTGC 900 

ACGCGGCAAC ACCACCAACG TTCGGAGTGG ACCTCATCAA TGAGCTTGTG GAGAACTTTG 960 

GCAGATGTCC CAAGTGGTCT GGTCGGCAAG CrTTTGTCTT TGTCTGCCAG ACTGTCATTG 1020 

AGGATGACTG CCTTCCCATG GACCAGTTTG CTGTGCATCT CATGCCGCAT CTGCTAACXTT 1080 

TAGCAAATGA CAGGGTTCCT AACGTGCGAG TGCTGCTTGC AAAGACATTA AGACAAACTC 1140 

TACTAGAAAA AGACTATTTC TTGGCCTCTG CCAGCTGCCA CXIAGGAGGCT GTGGAGCAGA 1200 

CCATCATGGC TCTTCAGATG GACCGTGACA GCX3ATGTCAA GTATTTTGCA AGCATCCACC 1260 

CTGCCAGTAC CAAAATCTCC GAAGATGCCA TGAGCACAGC GTCXTTCAACX: TACTAGAAGG 1320 

CTTGAATCTC GGTGTCTTTC CTGCTTCCAT GAGAGCCGAG GTTCAGTGQG CATTCGCCAC 1380 

GCATGTGACC TGGGATAGCT TTCGGGGGAG GAGAGACCTT CCTCTCCTGC QGACTTCATT 1440 

GCAGGTGCAA GTTGCCTACA CCCAATACCA GGGATTTCAA GAGTCAAGAG AAAGTACAGT 1500 

AAACACTATT ATCTTATCTT GACTTTAAKG KKWAWKMMWW KCTCAGMSRA TTATAmTSW 1560 

CWMMRARGSM WVMAAWSCTK SW3CTCVWCC KSRSTGRMKG MMRCTCTAGA AYTRGYRGAK 1620 

QWYYKSGCT KMWGGAAKKS GGCASGAGCC AGAGACCTGC ATTGCTTTCT CXrTGGTTTTA 1680 
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TTTAACAATC 
AGAGACCTTC 

5 

GCCAGCTGTC 
AGGGGTTTTT 
10 TTAAGCGCAG 
TTTTTCATGC 
TTTTAACTTG 

15 

GACCACTCTT 
TCTCTTGTCC 
20 TTGTTTTCTA 
ACTCGAGGGG 
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GACAAATGAA ATTCTTACAG CCTGAAGGCA GACGTGTGCC CAGATOTGAA 
AGTATCAGCC CTAACTCTTC TCTCCCAGGA AGGACTTGCT GGGCTCTCTG 
CAGCXXAGCC CTGTGTGTGA ATCGTTTGTG ACGTCTGCAA ATGGGAAAGG 
ACATCTCCTA AAGGACCTGA TGCCAACACA AGTAGGATTG ACTTAAACTC 
CATATTGCTG TACACATTTA CAGAATGGTT GCTGAGTGTC TCTCTCTGAT 
TGGTCATGAC CTGAAGGAAA TTTATTAGAC GTATAATOTA TGTCTGGTGT 
ATCATGATCA GCTCTGAGGT GCAACTTCTT CACATACTGT ACATACCTGT 
GGGAGTGCTG CAGTCTTTAA TCATGCTGTT TAAACTGTTG TGGCACAAC3T 
AAATAAAATT TATTAATAAG ATCTATAGAG AGAGATATAT ACACTTTTGA 
GATGTCTACC AATAAATOCA ATTTGTGACC TGTAAAAAAA AAAWAAAAAA 
GGCCCGGTAC CCAAATCGCC GATATGATCT AANCATC 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOCT: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
GGCCGCCGAG GCGCAACAGC CGTTCTGTCA GCTCTGGGTC CAACCGGACT AGCGAANATC 
TTCCTCATCC TCATCATCGT CTTCCTCATC CCGATCTCGG TCCAGGTCCC TCTCCCCCCC 
ACACAAGAGG TGGCGAAGGT CCAGCTGTAG TTCCTCTGGA CGTTCTCGAA GATGCTCTTC 
CTCTTCTTCG TCATCATCTT CCTCTTCGTC TTCCTCATCC TCATCATCCA GTTCTCGAAG 
CCGCTCACGA ATCCCCATCC CCCCGCCGGA GRAAGTGACA GGAGGCGGCG GTACAGCTCT 
TATCGTTCAC ATGACCATTA CCAAAGGCAA AGAGTGCTAC AAAAGGAGCG TGCAATAGAA 
GAAAGAAGGG TGGTCTTCAT TGGAAAGATA CCTGGCCGCA TGACTCGATC AGAGCTGAAA 
CAGAGGTTCT CCGTTTTTGG AGAGATTGAG GAGTGCACCA TCCACTTCCG TGTCCAAGGG 
GACAACTACG GCTTCGTCAC TTATCGCTAT GCTGAGGAGG CATTTGCAGC CATTGAGAGT 
GGCCACAAGC TGCGGCAGGC AGATGAGCAG CCCTTTGATC TCTGCTTTGG GGGCCGAAGG 
SWGTNCTGCA AGAQGAGCTA TTCTGATCTT GACTCCAACC GGGAAGACTT TGACCCAGCA 
CCTGTAAAGA GCAAATTTGA TTCTCTTGAC TTTGACACAT TGTTGAAACA GGCCCAGAAG 
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AACCTCAGGA GGTAACCTTG GGCCCTTCCC TGCTATCCTT TTTCTCCTTT GGAGGTGCCC 780 

AACCTCCTCC ACCCCCTTCC CCTACTCTAG GGGAGAGAGC TGCTAGTCAG ATGACTGTTT 840 

TATAAAGAAA TQGAAAAAAG TGAAATAAAA AATATGTTGA ATCAGATTTT TTAAAAGGGG 900 

TATTTGTTTT TTTATAACAG GTATTGAAAC AAGITAACTT GCATTCCTAT CTAAGATAGG 960 

AGGGGCTGAG GGGATCCCCA GTGTTTGGAA CATAAGTCAC TATGCAGACT AATAAACATC 1020 

AACTAGAGAG NAAAAAAAAA AAAAAAAAAA ATTTAAAAAA CT 1062 
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<2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GAGAGACTCA CACTTCTTTT CCAITATCAC TGACGATGTA GTGGACATAG CAGGQGAAGA 60 

GCACCTACCT GTGITGGTGA GGTTTGTTGA TGAATCTCAT AACCTAAGAG AGGAATTTAT 120 

AQGCTTCCTG CCTTATGAAG CCGATGCAGA AATTTTGGCT GTGAAATTTC ACACTATGAT 180 

AACTGAGAAG TGGGGATTAA ATATGGAGTA TTGTCGTGGC CAGGCTTACA TTGWCTCTAG 240 

TGGATTTTCT TCCAAAATGA AAGTTGTTGC TTCTAGACTT TYAAGMKMRA TWKCCCCMAK 300 

YWAWCKGAAC AMAMKCTGSW CYTCCWSYGC SKTRRMKRYC GYKSTATRRC WARWKSAKYM 360 

CCYGKKMTGS RRGTAWYTSK TGCAYKAGGG AACAATTGAG GAAGTTTGTT CTTTTTTCCA 420 

TCGATCACCA CAACTGCTTT TAGAACTTGA CAACGTAATT TCTGTTCTTT TTCAGAACAG 480 

TAAAGAAAGG GGTAAAGAAC TGAAGGAAAT CTGCCATTCT CAGTGGACAG GCAGGCATGA 540 

TGCTTTTGAA ATTTTAGTGG AACTCCTGCA AGCACTTGTT TTATGTTTAG ATGGTATAAA 600 

TAGTGACACA AATATTAGAT GGAATAACTA TATAGCTGGC CGAGCATTTG TACTCTGAGT 660 

GCAGTGTCAG ATTTTGATTT CATTGTTACT ATTGTTGTTC TTAAAAATGT CCTATCTTTT 720 

ACAAGAGCCT TTGGGAAAAA CYYCMAGGGG CAAACCTCTG ATGTCTTCTT TGCKKMMSRT 780 

ARMTTTTGAY ATRMARYACT RMMTKSAYTY AAYGRWGTGA CWSGAWAATA TTRAASTVTA 840 

TACAATKAAT YWTRRYTTSM KRMAGMYAAT CCGAAAYTGT GC3MAAMYAAA CTTGATATTC 900 

AAATGAAACT CCCTGGGAAA TTCCGCAGAG CTCACCAGGG TAACTTGGAA TCTCAGCTAA 960 

CCTCTGAGAG TTACTATAAA GAAACCCTAA GTGTCCCAAC AGTGGAGCAC ATTATTCAGG 1020 

AACTTAA7VGA TATATTCTCA GAACAGCACC TCAAAGCTCT TAAATGCTTA TCTCTGGTAC 1080 
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CCTCAGTCAT GGGACAACTC AAATTCAATA CGTCGGAGGA ACACCATGCT GACATGTATA 1140 

GAAGTGACTT ACCCAATCCT GACACGCTGT CAGCTGAGCT TCATrGTTGG AGAATCAAAT 1200 

5 

GGAAACACAG GGGGAAT^T ATAGAGCTTC CGTCCACCAT CTATGAAGCC CTCCACCTGC 1260 

CTGACATCAA GTTTTTTCCT AATGTTGTATG CATTGCTGAA GGTCCTGTGT ATTCTTCCTG 1320 

10 TGATGAAGGT TGAGAATGAG CGGTATGAAA ATGGACGAAA GCGTCTTAAA GCATATTTGA 1380 

GGAACACTTT GACAGACCAA AGGTCAAGTA ACTTGGCTTT GCTTAACATA AATTTTGATA 1440 

TAAAACACGA CCTGGATTTA ATGGTGGACA CATATATTAA ACTCTATACR AKTAMGTCAG 1500 

15 

MGCTYYCTAC AKAYRAYTCM SWAWNTrGTGG AAARYWSSTA MGMSWC3CWKK TAMMRROMCG 1560 

GMWWIYYYMK RKTYGAYMYW YGCGWMCGAG AAAAAGCCGT AAGGTGTATG TAGACCACTT 1620 

20 AATCACTAAA TATCTTTGCC TATAGGACTC CATTGAATAC ATTAGCCATT GATAATCTAC 1680 

CTGTTTAAAT GGCCCCTGTT TGAACTCTCA AGCTTTGAAG ACCTACXTTOT TCTTCCAGAA 1740 

GAGAACGTTG AAAGTGCCAT GTTTCCTTTT GCGTGATCTC TGTTGATGGC ACTCTGGAAT 1800 

25 

TGTTTCCAGT TTAAKTCATT TTAGACATAG CATTTATTAT CACTGTGGAT CTCTACTTGT 1860 

TGGGTGTTAT GAATTCTTTG AAGAATATAT TTTGAAGAGG TGTGGGAGGA AGGAATACAT 1920 

30 TTTATAAAAT GTTGTAGTGA AGCCCACAAT TGACCTTKGA CTAATAGGAG TTTTAAGTAT 1980 

GTTAAAAATC TATACTQGAC AGTTACAAGA AATTACCX3GA GAAAAGCTTG TGAGCTCACC 2040 

AAACAAGGAT TTCAGTGTAG AOTTTGrCTT TCTTGAACTT AAAGAAACAA ATGACAAAGT 2100 

35 

TTGAATGGAA AAGCXTTGCTG TTGTTCCACA TCTCGTTGCT GTTTACATTC CTTTGTGGAG 2160 

CCTACATCTT CCTAAGCTTT TTAGCAGGTA TATGTTGAAC ACTTCTGTTT CATGGTTGAG 2220 

40 ACAGAATCAG AGGCCATGGA TACTGACAAC TGATTTGTCT GTTTTTTTTC TCTGTCTTTT 2280 

TCCATGACTC TTATATACTG CCTCATCTTG ATTTATAAGC AAAACCTGGA AAACCTACAA 2340 

AATAAGTGTT GTQGTTTATC TAGAAAAATA TGGAAAATAT TGCTGTTATT TTTGGTGAAG 2400 

45 

AAAATCAATT TTGTATAGTT TATTTCAATC TAAATAAAAT GTGAATTTTG TTWWATTAAA 2460 

AATTW3GSAC AAABTBGHGG GGGOTCCAAA CHTWVTCGHG KAAMTTCTCT WAARMATYTK 2520 

50 ATAAACMSCT TCACAATTC 2539 



55 (2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1751 base pairs 

(B) TYPE: nucleic acid 
60 CO STRANDEENESS : double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 110: 

5 AGCATGAAGC CGATGGCCGT GGTGGCCAGT ACCGTCCTGG GCCTGGTGCA AAACATGCGT 60 

GCGTTTGGCG GGATCCTGGT GGTGGTCTAC TACGTATTTG CCATCATTGG GATCAACTTC 120 

TTTAGAGGCG TCATTGTGGC TCTTCCTGGA AACAGCAGCC TGGCCCCTGC CAATGGCTCG 180 

10 

GCGCCCTGTG GGAGCTTCGA GCAGCTGGAG TACTGGGCCA ACAACTTCGA TGACTTTGCG 240 

GCTGCCCTGG TCACTCTGTG GAACTTGATG GTGGTGAACA ACTGGCAGC5T GTTTCTGGAT 300 

15 GCATATCGGC GCTACTCAGG CCCGTGGTCC AAGATCTATT TTGTATTGTG GTGGCTOGTG 360 

TCGTCTGTCA TCTGGGTCAA CCTGTTTCTG GCCCTGATTC TGGAGAACTT CCTTCACAAG 420 

TQGGACCCCC GCAGCCACCT GCAGCCCCTT GCTGGGACCC CAGAGGCCAC CTACCAGATC 480 

20 

ACTGTGGAGC TCCTGTTCAG GGATATTCTG GAGGAGCCCG GGGAGGATGA GCTCACAGAG 540 

AGGCTGAGCC AGCACCCGCA CCTGTGGCTG TGCAGGTGAC GTCCGGGCTG CCATCCCAGC 600 

25 AGGGGCGGCA GGAGAGAGAG GCTGGCCTAA CACAGGTGCC CATCATGGAA GAGGCGGCCA 660 

TGCTGTGGCC AGCCAGGCAG GAAGAGACCT TTCCTCTGAC GGACCACTAA GCTGGGGACA 720 

GGAACCAAGT CCTTTGCGTG TGGCCCAACA ACCATCTACA GAACAGCTGC TGGTGCTTCA 780 

30 

GGGAGGCGCC GTGCCCTCCG CTTTCTTTTA TAGCTGCTTC AGTGAGAATT CCCTCGTCGA 840 

CTCCACAGGG ACCTTTCAGA CAAAAATGCA AGAAGCAGCG GCCTCCCCTG TCCCCTGCAG 900 

35 CTTCGGTGGT GCCTTTGCTG CCGGCAGCCC TTGGGGACCA CAGGCCTGAC CAGGGCCTGC 960 

"ACAGGTTASC"CGTGAGTCTG TCTCATCTAT TCACAGCreG~GAATGATACT AMACCTCOT^ 1020 

ATTTTAGCCC AGCACCACAG GGTACGTTCC AGTTTTTCTC TCTTTCCATA GCTGTAAGGC 1080 

40 

CCTTTCTGGG AATGGTTCTC ATTCTCCTTA ATCTATTATT GGGTCAGTTT TCCTGCATGT 1140 

CCCCAGCCTC CCATCACTGC CACCCACTCC CCACAGAGAT GCCCTGCTCA TCCGACTGGG 1200 

45 GCTTTGACTC CCACACTGTG TACCCCTCTT GTGTGGACGC CCTGCTGCCA AAACCTTCAG 1260 

CAAACAGCTT TCCAAATGGA AGTTGTCACT GTCAGGCCTT TACAATCAGC AACAGCAAAA 1320 

TCTACATGCT GCTGAGGGTC CTGCCTCATT AAGATGCAAT AAATATGTAA GTACATAAAA 1380 

50 

ACAGCAATAG AAGAAACGTA ATGCTTTATT CTCAAATATG ATGTCTACAT AGAAAAGCCA 1440 

AAATTATTAA GAATAGTAAG AATTCACCCA GCACTTTGGG AGGCCGAGGC GGGTGGATCA 1500 

55 TGAGGTCAGG AGATCGAGAC CATCCTGGCT AACAGGGTGA AACCCCGTCT CTACTAAAAA 1560 

TACAAAAAAT TGGCCGGGCG CAGTGGCGGG CGCCTGTGGT CCCAGCTACT GGGGAGGCTG 1620 

AGGCAGGAGA ATGGCGTGAA CCCGGGAAGC GGAGCTTGCA GTGAGCCGAG ATTGCGCCAC 1680 

60 
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TGCAGTCCGC AGTCCAGCCT GGGCGACAGA GCGAGACTCC GTCTCAAAAA AAAAAAAAAA 1740 
AAAAAAAAAA A 1751 

5 

(2) INFORMATIOTI FOR SBQ ID NO: 111: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 111: 
AATGTTGTGG TGGTAGCATT TGGGTTAATT CTRATTATAG AGTCTCTTGG AGAGCAATGT 60 
20 CCATAAACTA ATCCCAAACA ACATTGTCTT TTTRATGTTG TAGTGAACAG CAGAGAATTT 120 
CAAAGGACCT TGCTAATATC TGTAAGACGG CAGCTACAGC AGGCATCATT GGCTGGGTGT 180 
ATGGGGGAAT ACCAGCTTTT ATTCATGCTA AACAACAATA CATTGAGCAG AGCCAGGCAG 240 

25 

AAATTTATCA TAACCGGTTT GATGCTGTGC AATCTGCACA TCGTGCTGCC ACACGAGGCT 300 
TCATTCGTTA TGGCTGGCGC TGGGGTTGGA GAACTGCAGT GTTTGTGACT ATATTCAACA 360 
30 CAGTGAACAC TAGTCTGAAT GTATACCGAA ATAAAGATGC CTTAAGCCAT TTTGTAATTG 420 
CAGGAGCTGT CACGGGAAGT CTTTTTAGGA TAAACGTAGG CCTGCGTGGC CTGGTGGCTG 480 
GTGGCATAAT TGGAGCCTTG CTGQGCACTC CTGTAGGAGG CCTGCTGATG GCATTTCAGA 540 

35 

- . ACTACTCTGG-TGAGACTGTT-CAGGAAAGAA-AACAGAAGGA^TCGAAAGG^ 600- 

TAAAACTGGA AGAGTGGAAA GGCAGACTAC AAGTTACTGA GCACCTCCCT GAGAAAATTG 660 
40 AAAGTAGTTT ACAGGAAGAT GAACCTGAGA ATGATGCTAA GAAAATTGAA GCACTGCTAA 720 
ACCTTCCTAG AAACCCTTCA GTAATAGATA AACAAGACAA GGACTGAAAG TGCTCTGAAC 780 
TTGAAACTCA CTGGAGAGCT GAAGGGAGCT GCCATGTCCG ATGAATGCCA ACAGACAGGC 840 

45 

CACTCTTTGG TCAGCCTGCT GACAAATTTA AGTGCTGGTA CCTGTGGTGG CAGTGGCTTG 900 
CTCTTGTCTT TTTCTTTTCT TTTTAACTAA GAATGGGGCT GTTGTACTCT CACTTTACTT 960 
50 ATCCTTAAAT TTAAATACAT ACTTATGTTT GTATTAATCT ATCAATATAT GCATACATGA 1020 
ATATATCCAC CCACCTAGAT TTTAAGCAGT AAATAAAACA TTTCGCAAAA GATTAAAGTT 1080 
GAATTTTACA GTTAAAAAAA AAAAAAAAAA AAAAAAA 1117 

55 



60 



(2) INFORMATICaa FOR SBQ ID NO: 112: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LEHC3TH: 1313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 



GGCAGAGGTT TTCTTATATT TTAAGTAAAT TTAAAGTGGC TATCAGAATA TTTATTCTTG 60 

10 

TTTGAGACTA CCAACATAAC TACGTGTTGA AGGTGdTCA CAGAGAATAT ATTGCCTTTA 120 



ATGTGAAATA AOTTTCACCA ATGTTGCTAA CTTTAATAAA GTATAAAATT TGTAGAATAT 180 
15 TCAGTTAAGT AGTTGGTAAC CCTTTTCTAT TTTAGTAAAA CTTAATGCAT GTTTACTTTT 240 



TTTTGAAAGA TGCAGACAAT CTCTTTGAAC ATGAATTGGG GGCTCTCAAT ATGGCTGCAT 300 



TACTACGAAA AGAAGAAAGA GCAAGTCTTC TTAGTAATCT TGGCCCATGT TGTAAGGCGT 360 

20 

TGTGCTTCAG ACGGGATTCT GCAATTCGAA AGCAGCTTGT TAAAAATGAG AAGGGCACCA 420 



TAAAACAAGC TTACACGAGT GCTCCAATGG TAGACAATGA ATTACTTCGA TTGAGTCTTC 480 
25 GGTTATTTAA GCGGAAGACT ACTTGCCATG CTCCAGGACA TGAAAAGACT GAAGATAATA 540 



AACTTTCACA GTCCAGTATC CAACAGGAAC TGTGTGTGTC TTAAGACCGA AGTTACAATA 600 



TGGTATTTTT GGTACTGTCT TCCTTCAGCA GTGCATATTC TTTTGCAAAG ITCTTTQGTT lS60 

30 

TGACAAGCAT TAGTGACAAA GGCAGAAAAG ATTTATCAGC CATGCTAAAA GAGTGAAGAA 720 

TTTTGATGTT TAGAGACACT AGTTTTGGCC AACTTAAGAT TTTACCTTAA TTTTTACATA 780 

35 GTATTTGACA CTCATGCAAA ATAATGTGAA AACATCTAGA TTTAGTAGTT TATTCTGCGC 840 



CTTTTGTTAA" AACTGAAGAT' TTIH3GAAAAT GGTTGTCACT GCTCTTCCAG CCTATGAATA ' 900 



TTTTTGTGAA ATGGAACCAT GGATTTATGT CTGGATCATC CATACAGAAC CAACAATTTT 960 

40 

ATTCAAAAAC AATGTGTTCA TCAAAGTAAT TGCTCACATT GTGCAGTACT ATGTTGTACA 1020 



GACCACGTGA AAGGGAATGC TGGTCTAGCT GGCGTGGTAT GTTTATAGGC GAATTTCAGC 1080 
45 AGAAGGAAGC CAAAATAGTT ITTTCCTTTT GAAAGTTTTT TAAAAATTAT TTCATGGGTC 1140 
TTTTTTITAA TTAATATGTG TGCATTGTTA CAATGTATGT TGGGATGTCT TTTGACCCTA 1200 



AATGCTTTTT TTGTTATCAG AGATTGTGTA CTATTTTTAT TTTTAATAAA TGTATCTTCC 1260 

50 

CTTTTMAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 1313 



55 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1654 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 113: 

5 

ACAGGGACAG AATACTTTCT TTCCTTCCTT CAAGTACAAG AAGGCTTTCT CTACCATrTG 60 

CGTCTACACT TTATTTTAAA AGCTATCCTT TTCTAGTAGT ATTTTATCAT GGCAATGGCA 120 

10 TGATGACAAC AACAGTCTTT CATTACAGAC TGAAGGGAAG CATGTCCTTA CTTAAAATAG 180 

TTCTGCTACT TTCCCTCCTA TTATAAOGAA ATTTTACAGA TTCTAAAAAT ACCTTAATTT 240 

TTCTTTGATT TTTATTTTAC CAAGTCACAA ATGTCTTTTT GATGTTTTGA GAATTGTTCT 300 

15 

CATAGAATCA CAAATACTGA CATTTCATTA GATGATTATT TTCCTAGAAT CCCCAAAGAG 360 

CAGTC3GCAGT CCATGGCTTG GTTGAAGCTA GAAATTTTCC TGCCCCTGGT GACCTGGTAA 420 

20 GCCTCCTGCT CGGAACCGTG TGAGTGGGTG AGGAAGATGA GAGATGGTCA GATGGAAGAG 480 

AGRAATACAT GAACTGCTCT GGCCTCTCTG GTTCTGTTCT TCGCCCAGAG TTTTTGAAAA 540 

GCAGCGGANA TNGACTGACT TCACATGCTC AGCTTTCTCA GCCTTTTGTT TATTTTGTTG 600 

25 

TCCTTAGATT TCCCTGTTGT AAAAGGGGCA AGAAAAGTAA CTCATCATCT CTAACACACC 660 

ATGGCAGCTT AGCCAGGTAG TCTTAGTGGT GGTGTTTAGG CATAAGATAT GCTGATCATC 720 

30 AGTCTCAGGC CACAGTTTCC TTCACTAATC GTCCAGCTTG AGTGTTCTGT TCTCTTCCTG 780 

CCCATTTCCT TGAACCTCCT GCTCTAGCCT TGGCGGAGGG AGAGTGCTAT TTGCTTTTGT 840 

TCTCCCTCTG TCTTAGGAAA AGCCATCTTT AATATAGTTC TTCACCACTG TTGGGGTTCT 900 

35 

TTTOTGMTT^TTTTr^^ CXTIGGTTOTT. ATrGGATTTr_GTATTTrAAT 5^J0_. 

ACAAATTATT GAATTTTATA AGCTTGTACA CAATATTTAA TTAGTGTGAA AGGAAACAAA 1020 

40 GAATGCAGGA AAAATAATTT AATATCAACC TCAGTTGACA AGGTGCTCAG ATTATTCAAT 1080 

TCGGGATCCT CCTTTTGTTA GGTTTTTGAG ACAACCCTAG ACCTAAACTG TGTCACAGAC 1140 

TTCTGAATGT TTAGGCAGTG CTAGTAATTT CCTCGTAATG ATTCTGTTAT TACTTTCCTA 1200 

45 

TTCTTTATTC CTCTTTCTTC TGAAGATTAA TGAAGTTGAA AATTGAGGTG GATAAATACA 1260 

AAAAGGTAGT GTGATAGTAT AAGTATCTAA GTGCAGATGA AAGTGTGTTA TATACATCCA 1320 

50 TTCAAAATTA TGCAAGTTAG TAATTACTCA GGGTTAACTA AATTACTTTA ATATGCTGTT 1380 

GAAYCTACTC TGTTCCTTGG CTAGAAAAAA TTATAAACAG GACTTTGTAG TTTGGGAAGC 1440 

CAAATTGATA ATATTCTATG TTCTAAAAGT TGGGCTATAC ATAAATTATT AAGAAATATG 1500 

55 

GATTTTTATT CCCAGGATAT GGTGTTCATT TTATGATATT ACGCAGGATG ATGTATTGAG 1560 

TAAAATCAGT TTTGTAAATA TGTAAATATG TCATAAATAA ACAATGCTTT GACTTATTTC 1620 

60 CAAAAAAAAA AAAAAATAAA NTTCGAGGGG GGGC 1654 
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55 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: lineeir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 60 

GGGTTGCGNC GGCGCCCTGG CCCGAAGAAG CGCAATTGGC GTTCCGCGAA CGTTGGCCCT 120 

CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTGGG CTACAAGGAC 180 

arCGMCCrtCA TCTTCCTGCG CCGACCTGCG CGGGGTAAGG GGWAGTTICA GACTGTGAAG 240 

GACGTCGTGC TGGACTGCCT GTTGGACTTC TTACCCGAGG GGGTGAACAA AGAGAAGATC 300 

ACACCACTCA CGCTCAAGGA AGCTTATGTG CAGAAAATGG TTAAAGTGTG CAATGACTCT 360 

GACCGATGGA GTCTTATATC CCTGTCAAAC AACAGTGGCA AAAATGTGGA ACTGAAATTT 420 

GTGGATTCCC TCCGGAGGCA GTTTGAATTC AGTGTAGATT CTTTTCAAAT CAAATTAGAC 480 

TCTCTTCTGC TCTTTTATGA ATGTTCAGAG AACCCAATGA CTGAGACATT TCACCCCACA 540 

ATAATCGGGG AGAGCGTCTA TGGCGATTTC CAGGAAGCCT TTGATCACCT TTGTAACAAG 600 

ATCATTGCCA CCAGGAACCC AGAGGAAATC CGAGGGGGAG GCCTGCTTAA GTACTGCAAC 660 
CrcTTGGTGA GGGGCTTTAG~GCCCGCCTCT"GATGAAATCA AG^ AACCrTATATG " ' 720 

TGTTCCAGGT TTTTCATCGA CTTCTCAGAC ATTGGAGAGC AGCAGAGAAA ACTGGAGTCC 780 

TATTTGCAGA ACCACTTTGT GGGAATTGGA AGACCGCAAG TATGAGTATC TCATGACCCT 840 

TCATGGAGTG GTAAATGAGA GCACAGTGTG CCTGATGGGA CATGAAAGAA GACAGACTTT 900 

AAACCTTATC ACCATGCTGG CTATCCGGGT GTTAGCTGAC CAAAATGTCA TTCCTAATGT 960 

GGCTAATGTC ACTTGCTATT ACCAGCCAGC CCCCTATGTA GCAGATGCCA ACTTTAGCAA 1020 

TTACTACATT GCACAGGTTC AGCCAGTATT CACGTGCCAG CAACAGACCT ACTCCACTTG 1080 

GCTACCCTGC AATTAAGAAT CATTTAAAAA TGTCCTGTGG GGAAGCCATT TCAGACAAGA 1140 

CAGGAGAGAA aaaaaaaaaa aaaaaaaaaa a 1171 



(2) INFORMATION FOR SEQ ID NO: 115: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENCmi: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOIiOGY: linear 

5 

<xi) SEQUENCE DESCRIPTION; SE3Q ID NO: 115: 
GGTCTGCGCC GGAAGTGCAT GAGCTGCCGA TGTGGTGCTT AGTGATTGCG GTTTCGGTOG 60 
10 CTCTCCCGTG TTTCCCGGGC TQGCTATTTG CCTCGCACCA TQGCGCCCAA GGGCAAAGTG 120 
GGCACGAGAG GGAAGAAGCA GATATTTGAA GAGAACAGAG AGACTCTGAA GTrCTACCTG IBO 
CGGATCATAC TGGGGGCCAA TGCCATTTAC TGCCTTGTGA CGTTGGTCTT CTTTTACTCA 240 

15 

TCTGCCTCAT TTTGGGCCTG GTTGGCCCTG GGCTTTAGTC TGGCAGTGTA TGGGGCCAGC 300 
TACCACTCTA TGAGCTCGAT GGCACGAGCA GCGTTCTCTG AGGATGGGGC CCTGATOGAT 360 
20 GGTTCGCATGG ACCTCAACAT GGAGCAGGGC ATGGCAGAGC ACCTTAAGGA TGTGATCCTA 420 
CTGACAGCCA TCGTGCAGGT GCTCAGCTGC TTCTCTCTCT ATGTCTGGTC CTTCTGGCTT 480 
CTGGCTCCAG GCCGGGCCCT TTACCTCCTG TGGGTGAATG TGCTGGGCCC CTGGTTCACT 540 

25 

GCAGACAGTG GCACCCCAGC ACCAGAGC/^ AATGAGAAAC GGCAGCGCOG ACAGGAGCGG 600 
CQGCAGATGA AGCGGTTATA GCCATTGACA TTGTGGCCAC AGGCCACTGG CCCTGGGTGG 660 
30 CTCTGTCAGG GTGCACAGCC CCTCATGCCT GGAGCAATGA GGGTCTAGTC CAGGGGCCAA 720 
AAGCAGTCTG AGGTATTGGG TATACTTATA CTCTATAGGG TCGTTGAATA AATGGCTTAG 780 
7ATGTGAAAA AAAAAT^AAAA AAAAAACTCG AGGGGGGCCC GGTACCCAAT TTCNCCTANA 840 

35 

AT 842 



40 

(2) INFORMATION FOR SEQ ID NO: 116: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1640 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

50 

GGCACGAGGC GGCGGCAGCG GTGGCGGCGG CGCCCCCCGG CGGGAGCCGT TCCCTTTCCC 60 
GTCGGGGAGC GCGGGGYCGG GGCCCAGGGG ACCCCGGGCC ACGGAGAGCG GGAAGAGGAT 120 
55 GGATTGCCCG GCCCTCCCCC CCGGATGGAA GAAGGAGGAA GTGATCCGAA AATCTGGGCT 180 
AAGTGCTGGC AAGAGCGATG TCTACTACTT CAGTCCAAGT GGTAAGAAGT TCAGAAGCAA 240 
GCCTCAGTTG GCAAGGTACC TGGGAAATAC TGTTGATCTC AGCAGTTTTG ACTTCAGAAC 300 

60 
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TGGAAAGATG ATGCCTAGTA AATTACAGAA GAACAAACAG AGACTGCGAA ACGATCCTCT 360 

CAATCAAAAT AAGGGTAAAC CAGACTTGAA ATACAACATT GCCAATTAGA CAAACAGCAT 420 

5 CAATTTTCAA ACAACCGGTA ACCCAAAGTC ACAAATCATC CTAGTAATAA AGTGAAATCA 480 

GACCCACAAC GAAOXSAATGA ACAGCCACGT CAGCTTTTCT GGGAGAAGAG GCTACAAGGA 540 

CTTTAGTGCA TCAGATGTAA CAGAACAAAT TATAAAAACC ATGGAACTAC CCAAAGGTCT 600 

10 

TCAAGGAGTT GGTCCAGTAG CAATGATGAG ACCCTTTTAT CTGCTGTTGC CAGTGCTTTG 660 

CACACAAGCT CTGCX3CCAAT CACAGGGCAA GTCTCCGCTG CTGTGGAAAA GAACCTGCTG 720 

15 TTTGGCTTAA CACATCTCAA CCXTCTCTGCA AAGCTTTTAT TGTCACAGAT GAAGACTCAG 780 

GAAACAGAAG AGCGAGTACA GCAAGTACGC AAGAAATTGG AAGAAGCACT GATGGCAGAC 840 

ATCTTGTCGC GAGCTGCTGA TACAGAAGAG ATGGATATTG AAATGGACAG TCGAGATGAA 900 

20 

GCCTAAGAAT ATGATCAGGT AACTTTCGAC CGACTTTCCC CAAGAGAAAA TTCCTAGGAA 960 

ATTGAACAAA AATGTTTCCA CTGGCTTTTG CCTGTAAGAA AAAAAATGTA CXTCGAGCACA 1020 

25 TAGAGCTTTT TAATAGCACT AACCAATGCC TTTTTAGATG TATTTTTGAT GTATATATCT 1080 

ATTATTCAAA AAATCATGTT TATTTTGAGT CCTAGGACTT AAAATTAGTC TTTTGTAATA 1140 

TCAAGCAGGA CCCTAAGATG AAGCTGAGCT TTTGATGCCA GGTGCAATCT ACTGGAAATG 1200 

30 

TAGCACTTAC GTAAAACATT TGTTTCCCCC ACAGTTTTAA TAAGAACAGA TCAGGAATTC 1260 

TAAATAAATT TCCCAGTTAA AGATTATTGT GACTTCACTG TATATAAACA TATTTTTATA 1320 

35 CTTTATTGAA AGGGGACACC TGTACATTCT TCCATCGTCA CTGTAAAGAC AAATAAATGA 1380 

- TTATATTeGA <:AGAAAAAAA- AAAAAAAAAW-mSTYGARRR- GSRGCl^RSW - 1440 

CCWMRTWRGS MKTCSTOTKA YTTACATTCA ACTCTGATCC CGGGGCCTTA GGTTTGACAT 1500 

40 

GGGAGGTGGG AGGAAGATAG CGCATATATT TGCAGTATGA ACTATTGCCT CTGQGACGTT 1560 

GTGAGGAATT GTGCTTTCAC CAGAATTTCT AAGGATTTCT GGCTTAAATA TCACCTAGCC 1620 

45 TGTGGTAATT TTTTTTCCCT 1640 

50 (2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 952 base pairs 

(B) TYPE: nucleic acid 
55 . (C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTIC»T: SEQ ID NO: 117: 

60 TGAATTTAGN AAACACTTTG GAAAACTCAT AACCTCATCA GAAACTGCCT TTAGCCACAC 60 

ISDOCID: <WO_9839448A2J_> 
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TCCTGACCTT CTAGATGAGT AACAAAAAAA TGAAATAAGT TCTTGGAAAT TAAGCCATTT 120 

ATTTTAATTT GCTATTTTTT TCAATGTTCT AGGTATCTTT AAAlTTGrTA TTGTGGAATC 180 

5 

AmTCCTGC CAGATACCTT TATCAAAATT ATTGGCCTCA TGAGAGCTGA AGTAAGTCAG 240 

CTTTTTGGTG AACTTTAC5TG GACTTCTGTG AGATTGTAGT TGTACTTTGT ATCTCTAAAT 300 

10 CTAAAGATAG TTTTTTAAAA CTCCCAAAGA AAATCTGCTC ICCTTTCTGA TCTAAAAACT 360 

CATCTTTGGG GTAAAGAGTT AAGTGTCCAA AGGTTGTCAC AGTTCATGAG CTTCAGAGGGA 420 

GCTAGCXrrGG CACCTQGACT CTGCCCATCC ACAGCTGACA GATTCCAACA GAAGTGTATT 480 

15 

TAAATTCTCC AGTAGACAAT GCTGGGTAAG GGAGGGGGTA GGGCTGGGTT ATTAAGATAC 540 

AGGCTGCTGT ATTITACATT GGTTGTGGGG GAAGGGGAGC CTGGAGAAAA CAAAGTCACT 600 

20 ATTCCXriTTT TTGAAACAGG AAAAAAAATT ATTTTTTGTT CAGTAAAAAT GGTAGAGAAT 660 

TCCAATGTCC CTAGCCACAA GGGACCAGTT CCACTGAGAA GTGAACAGTG GGAACTCAAA 720 

ATTTCAGAAA CATTGGGGGA AGGGAAAATT GGCTTTCTCT TAATTGGCAG ATGTTCCAGT 780 

25 

GGGGSGGGGG GGCTCTGTTT TTGTTGGGAT GTGTTATGTT GTATGTACGC ATATATGGAC 840 

CGGAGTCTGC TGAGTTTATA AQGTTCCAAA AATATGGTAA AATCTTGGTT TTTGTTAATT 900 

30 TATCTCAATA AAAGCCCACT GGRACTCCAA AAAAAAAAAA AAAAAAAAGA NN 952 

35 (2) INFOE»1ATIC»l FOR SEQ ID NO: 118: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1256 bcise pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

45 GACGTCATAG GTAAACAGGC TCTGTATCCG TGGCAGCGGC CGTGGCAGGC TGGCTGGGTA 60 

CCGGCTGTCG CTGACCCAQG AGAAGCTGCC TGTCTACATC AGCCTGGGCT GCAGCGCGCT 120 

GCCGCCGCGG GGCCGGCAGC TGAACTATGT GCTCTTCAGG GCGGQCACCG TGTTGCATTC 180 

50 

ATCTTTGTAC CCCCAGCATC TAGCAGTGTT GGCATGTAGT AGGCACTCAA GAAATGTGTG 240 

TTGAATGAAC GATGCCTGTG ACAAGCAAGC GGACTTTATT CTTTCCTGAC CCTTGCTCCT 300 

55 ATGACACACC TCCTCCTGAC TGCCACTGTC ACTCCTTCAG AGCAGAACTC CTCTAGGGAA 360 

CCTGGATGGG AAACAGCCAT GGCCAAGGAC ATCCTGGGTG AAGCAGGGCT ACACTTTGAT 420 

GAACTGAACA AGCTGAGGGT GTTGGACCCA GAGGTTACCC AGCAGACCAT AGAGCTGAAG 480 

60 
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GAAGAGTGCA 
ATTGAGCTTG 
GCTCGGAACT 
GCCCTAATAG 
TGTAAAGTAG 
GAAAATTTCG 
ACCTCTGTAG 
GAATGCAGCC 
ACAGCTGCTG 
AGCAGnTTGT 
AATAAATTTT 
TTTAACTCCT 
25 CAGTGGTCAG 



10 



15 



20 



AAGACTTTGT GGACAAAATT GGCCAGTTTC 
TTGATCAACT TGCAAAAGAA GCAGAAAATG 
TGCTCAAATC TATAGCAAAG CAGAGAGAAG 
CAGAAAAGAA AATGCAGCTA GAAAGGTATC 
AAGCAGAACA AAATGAATTT ATTGACCAAT 
CTTTTATAGT AGGAAGGCAA AACAAAAAAA 
CATTCCAGCX; GCTTGACCAA TGACCTATGT 
CCCTGAAGAC AGCACTACAA GTCTGGQGGA 
CTGGTGGCCC TGCAGTGTAC GTTCTCACCT 
AAACTTTCAT CCTTTTTTTT GTAAATTCAC 
TGTTTTCAAA TGGCTTGATG TACCTTTTTT 
CATGAGAGAA CCCTGGATTC TCTATCCCCT 
CAGCTACCTT TNATTTGGAT CACACACX5TG 



AGAAAATAGT TGGTGGTTTA 540 

AAAAGATCAA GGCCATCQGT 600 

CTCAACAGCA GCAACTTCAA 660 

GGGTTGAATA TGAAGCTTTG 720 

TTATTTTTCA GAAATGAACT 780 

AGCCTCTCAA AACCAAAAAA 840 

CACAAGAGGT GGCGTGTAAG 900 

GCCAGTTTTA ACATCAGTGC 960 

CTTATGCTTA GTTGGAACTA 1020 

AAT^GCTTTGG AAGGAGAAGC 1080 

CCTGTTGCTC TTGAAATATG 1140 

AGTCCACAAA ACAAACCAGG 1200 

T^TCAGACAG TACCAC 1256 



30 (2) INFORMATION FOR SEQ ID NO: 119: 



35 



40 



45 



50 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

GGCCGTAGCA GCCGGGCTGG TCCTGCTGCG AGCCGGCGGC CCGGAGTGGG GCGGCGGCAT 60 

GTACCTTCCA CATTGAGTAT TCAGAAAGAA GTGATCTGAA CTCTGACCAT TCTTTATGGA 120 

TACATTAAGT CAAATATAAG AGTCTGACTA CTTGACACAC TQGCTCGAGC AAACATGAAC 180 

GTTGGAGTTG CCCACAGTGA AGTGAATCCA AATACCCGTG TCATGAACAG CCGGGGTATG 240 

TGGCTGACAT ATGCATTGGG AGTTGGCTTG CTTCATATTG TCTTACTCAG CATTCCCTTC 300 

TTCAGTGTTC CTGTTGCTTG GACTTTAACA AATATTATAC ATAATCTGGG GATGTACGTA 360 

TTTTTGCATG CAGTGAAAGG AACACCTTTC GAAACTCCTG ACCAGGGTAA AGCAAGGCTC 420 

CTAACTCATT GGGAACAACT GGACTATGGA GTACAGTTTA CATCTTCACG GAAGTTTTTC 480 

ACAATTTCTC CAATAATTCT ATATTTTCTG GCAAGTTTCT ATACGAAGTA TGATCCAACT 540 

CACTTCATCC TAAACACAGC TTCTCTCCTG AGTGTACTAA TTCCCAAAAT GCCACAACTA 600 

CATGGTGTTC GGATCTTTGG AATTAATAAG TATTGAAATG TTTTGAAACT GAAAAAAAAT 660 
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20 



TTTACAGCTA CTGAATTTCT TATAAGGAAG GAGTQGTTAG TAAACTGCAC TGTTTCTSTG 720 

ATAATGTGAA ATGAGAACTA TTTACATTGG AGGGCCAATG GCTGGTCCTT CAAGTGCTGT 780 

TTTGAAGTGC AGATTTCCAT TAAATGATGC CTCTCTTTAA TACACCTGGT ACATTTCTGA 840 

AGAGGGGCTT TATAAGCAGG CTGGGCAGGC CCAGCTTATA AGTTAAAGGG CATCACAGTG 900 

AGGGTGTAGT AGATAAATTC AAGGAAATAA GAGAnTGTA AGAAACTAGG ACCAGCTTAA 960 

CTTATAATGA ATGGGCATTG TGTTAAGAAA AGAACATTTC CAGTCATTCA GCTGTGGTTA 1020 

TTTAAAGCAG ACTTACATGT AAACCGGAAT CCTCTCTATA CAAGTTTATT AAAGATTATT 1080 

TTTATTACCG TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAANA 1140 

GAN 1143 
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(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1782 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

CAGGCCCCGG CCCCCCACCC ACGTCTGCGT TGCTGCCCCG CCTGGGCCRG GCCCCAAAGG 60 

CAAGGACAAA GCAGCTGTCA GGGAACCTCC GCCGGAGTCG AATTTACGTG C7«3CTGCCGG 120 

CAACCACA<S3" TTCCAAGATC CriTTGCQOGG GCTOX3CCTG TTCC5^GAAC"TCCCT 180 

CCCTCAACCT GCTTTACACC TTGGTTAGTC TGCTGCTAAT TGGAATTGCT GCGTGGGGCA 240 

TTGGCTTCGG GCTGATTTCC AGTCTCCGAG TGGTCGGCGT GGTCATTGCA GTGGGCATCT 300 

TCTTGTTCCT GATTGCTTTA GTGGGTCTGA TTGGAGCTGT AAAACATCAT CAGGTGTTGC 360 

TATTYTTTTA TATGATTATT CTGTTACTTG TATTTATTGT TCAGTTTTCT GTATCTTGCG 420 

CT TGTTTAGC CCTGAACCAG GAGCAACAGG GTCAGCTTCT GGAGGTTGGT TGGAACAATA 480 

CGGCAAGTGC TCGAAATGAC ATCCAGAGAA ATCTAAACTG CTGTGGGTTC CGAAGTGTTA 540 

ACCCAAATGA CACCTGTCTG GCTAGCTGTG TTAAAAGTGA CCACTCGTGC TCGCCATGTG 600 

CTCCAATCAT AGGAGAATAT GCTGGAGAGG TTTTGAGATT TGTTGGTGGC ATTGGCCTGT 660 

TCTTCAGTTT TACAGAGATC CTGGGTGTTT GGCTGACCTA CAGATACAGG AACCAGAAAG 720 

ACCCCCGCGC RAATCCTAGT GCATTCCTTT GATGAGAAAA CAAGGAAGAT TTCCTTTCGT 780 

ATTATGATCT TGTTCACTTT CTGTAATTTT CTGTTAAGCT CCATTTGCCA GTTTAAGGAA 840 
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GGAAACACTA TCTGGAAAAG TACCTTATTG ATAGTGGAAT TATATATTTT TACTCTATGT 900 

TTCTCTACAT GTTTTTTTCT TTCCGTTGCT GAAAAATATT TGAAACTTGT GC3TCTCTGAA 960 

5 GCTCQGTGGC AOCTGGGAAT TTACTGTATT CATTGTCGGG CACTGTCCAC TGTGGCCTTT 1020 

CTT7W3CATTT TTACCTGCAG AAAAACTTTG TATGGTACCA CTGTCTTGGT TATATGGTGA 1080 

ATCTGAACGT ACATCTCACT GGTATAATTA TATGTAGCAC TGTGCTGTGT AGATAGTTCC 1140 

10 

TACTGGAAAA AGAGTGGRAA TTTATrAAAA TCAGAAAGTA TGAGATCCTG TTATGTTAAG 1200 

GGAAATCCAA ATTCCCAATT TTTTTTGGTC TTTTTAGGAA AGATGTOTTG TGGTAAAAAC3 1260 

15 TCTTAGTATA AAAATGATAA TTWACTKGTA GTCTTTTATG ATWACACCAA TCTATTCTAG 1320 

AAATAGTTAT GYCVTAGGAA ATTGTGGTTT AATTTTTGAC TTTTACAGGT AAGTGCAAAG 1380 

GAGAAGTGGT TTCATGAAAT GTTCTAATGT ATAATAACAT TTACCTTCAG CCTCCATCAG 1440 

20 

AATGGAACGA GTTTTGAGTA ATCAGGAAQT ATATCTATAT GATCTTGATA TTGlTrrATA 1500 

ATAATTTGAA GTCTAAAAGA CTGCATTTTT AAACAAGTTA GTATTAATGC GTTGGCXX:AC 1560 

25 GTAGCAAAAA GATATTTGAT TATCTTAAAA ATTGTTAAAT ACCGTTTTCA TGAAAGTTCT 1620 

CAGTATTGTA ACAGCAACTT GTYAAACCTA AGCATATTTG AATATGATCT CCCATAATTT 1680 

GAAATTGAAA TCGTATTGTG TOGCTCTGTA TATTCTGTTA AAAAATTAAA GGACAGAAAC 1740 

30 

CTTTCTTTGT GTATGCATGT TTGAATTAAA AGAAAC3TAAT GG 1782 



35 

(2) INFORMATION FOR SEQ ID NO: 121: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 610 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

45 

GnTGGCTGCA GATTTC5TGGT GOGTTCTGAG CCGTCTGTCC TGCGCCAAGA TGCTTCAAAG 60 

TATTATTAAA AACATATGGA TCCCCATGAA GCCCTACTAC ACCAAACyiTT ACCAGGAGAT 120 

50 TTGGATAGGA ATGGGGCTGA TGGGCTTCAT CCTTTATAAA ATCCGGGCTG CTGATAAAAG 180 

AAGTAAGGCT TTGAAAGCTT CAGCGCCTGC TCCTGGTCAT CACAACCAGA TTTACTTGGA 240 

GTACATGTGA AAGAAAACGT CAGTCTGCCT GTAAATTTCA GCAAGCCGTG TTAGATGGGG 300 

55 

AGCGTGGAAC GTCACTGTAC ACTTGTATAA GTACCGTTTA CTTCATGGCA TGAATAAATG 360 

GATCTGTGAG ATGCACTGCT ACCTGGTACT GCTTTCAGTG TGTrCCCCCT CAGCCCTCCG 420 

60 GCGTGTCAGG CATACTCTGA GTAGATAATT TGTCATGCAG CGCATGCAAT CAGAATCTCA 480 
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CTGAGCCACC CATCATTGTG AAATAATTAC CTCAGTTGTA CAGGACTTGG TGATCAQGAT 540 

CCAGGCACTC ACTTGTATTC TACTGCTCAA TAAACGTTTA TTAAACTTGA AAAAAAAAAA 600 

AAAAAAAAAA 610 

(2) INFORMATION FOR SEQ ID NO: 122: 



(i) SEQUENCE CHARACTERISTICS: 

(A) L0{GTH: 526 base pairs 
15 (B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEC^ffiWCE DESCRIPTION: SEQ ID NO: 122: 

GGTACGCCTG CAGGTACCGG TCCGGAATTC CGGGTCGCCC ACGCGTCNGG CCACGCGTCC 60 

ACCCACGCGT CCGSCCACGC GTCGGAQCCG AGCCGGACTG GTCAGGATGA TCACGGACGT 120 

25 GCAGCTCGCC ATCTTCGCCA ACATGCTGGG CGTGTCGCTC TTCTTGCTTG TCGTTCTCTA 180 

TCACTACGTG GCCGTCAACA ATCCCAAGAA GCAGGAATGA AAGTGGCGCT TTCTCCGCCC 240 

CAGGGTTCCA GGACATAGTC TGAGGCAAGA TGGAGGGTAT GAGGGGCCTT CACACTTCAC 300 

30 

TTCATCCCTT CTACCCATCA CAACATACAA AGCAACTACA CCTGGATTTT TCCAAACAAC 360 

TTTTATTTCC TCAGAGTCTT CCTTAATCCT ATGGAACAAG AAGCTGCCAC TGAATAGGGC 420 

35 CCAGTATAGG GGCTTGCTTT TCTACTCCCT CCCCCCAATA TAAAAATATA GACTTTTTAA 480 

- - AAAAAAAAAA AAAAANTTGG NGQGGGGSCC GGTACeGATC CeCCTA 526 



(2) INFORMATION FOR SEQ ID NO: 123: 



(i) SEQUENCE CHARACTERISTICS: 
45 <A) LENGTH: 2081 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

TGTACCGGTC CGGAAATTCC CGGGTCGACC CACGTCGTCS GGGGAACATG GCGGCTKCGG 60 

AGCCGGCGGT CCTTGCGCTC CCCAACAGCG GCGCCGQGGG CGCGGGGGCG CCGTCGGGCA 120 

55 

CAGTCCCGGT GCTCTTCTGT TTCTCAGTCT TCGCGCGACC CTCGTCGGTG CCACACGGGG 180 

CGGGCTACGA GCTGCTCATC CAGAAGTTCC TCAGCCTGTA CGGCGACCAG ATCGACATGC 240 

60 ACCGCAAATT CGTGGTGCAG CTGTTCGCCG AGGAGTGGGG CCAGTACGTG GACTTGCCCA 300 
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AGGGCTTCGC GGTRAGCGAG CGCTGCAAGG TGCXXXTTCGT GCCGYTGCAG ATCCAGCTCA 360 

CTACCCTGGG AAATCTTACA CCTTCAAGCA CTGTCTTTTT CTGCTCTGAT ATGCAGGAAA 420 

5 

GGTTCAGACC AGCCATCAAG TATTTTGGGG ATATTATTAG CGTGGGACAG AGATTCTTGC 480 

AAC3GGGCCCG GATTTTAGGA ATTCCTGTTA TTGTAACAGA ACAATACCCT AAAGGTCTTG 540 

10 GGAGCACGGT TCAAGAAATT GATTTAACAG GTGTAAAACT GGTACTTCCA AAGACCAAGT 600 

TTTCAATGGT ATTACCAGAA GTAGAAGCX3G CATTAGCAGA GATTCCCGGA GTCAGGAGTG 660 

TTGTATTATT TGGAGTAGAA ACTCATGTGT GCATXXAACA AACTCCXXTTG GAGCTAGTTG 720 

15 

GCCGAGGAGT CGAGGTTCAC ATTGTTGCTG ATGCXZACCTC ATCAAGAAGC ATGATGGACA 780 

GGATGTTTGC CCTCGAGCGT CTCGCTCRAR CCGGGATCAT AGTGACCACG AGTGAGGCTG 840 

20 TTCTGCTTCA GCTGGTAGCT GATAAGGACC ATCCAAAATT CAAGGAAATT CAGAATCTAA 900 

TTAAGGCGAG TGCTCCAGAG TCGGGTCTGC TTTCCAAAGT ATAGGACATT TGAAGAACTC 960 

GTATGCTACT CACTGGTGAA GGACAGTCAG GTGAAGGACT GTAAGCCCAC ACAAGCTCTT 1020 

25 

CTTATCTCTA CTAGAATTAA AATGTTAAGT CAAAAACGGC TCCTTTTTTG CX5CCTCCTAG 1080 

TGAAACTTAA CXIAGCTAGAC CATTTGAGTA CX:AGCATTTA GTTACAAACG TCAAAGGCTT 1140 

30 CCGGTGCTGC TTACCTTCCT TTTTTGTTAA TGTGCTTTTA TTTATTAAAA AAAATTACAA 1200 

TGAAGATGCC TGTTTTGTCT CTACTGTGTA CTCTGATCGT ATCTTTCCAA AGTGCAGACT 1260 

CTTGTGAAGT TTTCTTAAAT TGTTCACTTT AAAGAAAATG ACGTACCAAC AATGATTTGG 1320 

35 

CTTTTATATT ACTGTAAGAT GTTATAATGT TAATGTQGAT GTAGTGCTTT TACTTTACAG 1380 

ATTGATTGGA ATAAGATTAT TGCATATGAA TTTACCCACA GGACTCTGAA TCATGTTACC 1440 

40 CACTCCCCTC ACAATGTTGT CCACTTAGTG AGTTGCATTG ATCTATCCGT ACCAAATGAT 1500 

GTTGAATAAT TACATATCTT TCTTGACTAT ACTGATTTCT TATTTTGGTC ACTATTACTA 1560 

AATCTCTGTT AATATTCTCT CmTAACTG AAAAGGGATG GGATAGAAGG GTTTGCAATG 1620 

45 

CCATATTATT GGTGGAGGGC TGTTTTAACA TCTTTGAAGT ATGGCTTGCT GAATATCTTT 1680 

ACCAACATCT TGAATATATA TTCTAGTGTC CACAAGATTT AGCAAAAAGA TAAAGCTTGG 1740 

50 GTGGAATATC ATTTTAAAAT GTTCATGTTC TGTTCTATAT TTTCTTCACC TACTCTCCAA 1800 

ATATTGTAAT GCAAAAAGTC TCAGTAATGA TTTQGTAGTA TTAATTTTGT GGTCATTGTT 1860 

TCTCTTCGAT AAATTTATTT TCATTAAATA CTTRTTAGAG GGTITTGAAA TGTTmCAA 1920 

55 

ATATGTGAAA TGTGAAACTG CTGTCTTTTA TATTAAAGTA ATTAAAGAAA ATGTATTGTG 1980 

ATTGAAATTA TTTTGNCXrrC CACAAGATGG CTCTATGAGT ATTCTTCCAG GGATTCTAAT 2040 

60 ATTTATTTAA GGTTNATAAAA TCTTOACATT TATAATCTTT C 2081 
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(2) INFORMATION FOR SBQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEENESS : doiible 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 124: 

CCCCGGCGGA GCTQGACCCG CGGTOGGCTA GGGGCAGGGC CGGAGCCGCG GCGGCGGAGC 

TGTGGATCCT TCATGATGAG AGATTTGGGG ACACTTCTCT CTCCTGTGTC TAGTTGATAG 

TTTGGTGGTG AAGAGATGGC TGACAGTGTC AAAACCTTTC TCCAGGACCT TGCCAGAGGA 

ATCAAAGACT CCATCTGGGG TATTTGTACC ATCTCAAAGC TAGATGCTCG AATCCAGCAA 

AAGAGAGAGG AGCAGCGTCG AAGAAGGGCA AGTAGTGTCT TGGCACAGAG AAGAGCCCAG 

AGTATAGAGC GGAAGCAAGA GAGTGAGCCA CGTA1TGTTA GTAGAATTTT CCAGTGTTGT 

GCTTGGAATG GTGGAGTGTT CTGGTTCAGT CTCCTCTTGT TTTATCGAC3T ATTTATTCCT 

GTGCTTCAGT CGGTAACAGC CCGAATTATC GGTGACCCAT CACTACATGG AGATGTTTGG 

TCGTGGCTGG AATTCTTCCT CACGTCAATT TTCAGTQCTC TTTGGGTGCT CCCCTTGTTT 

GTGCTTAGCA AAGTGGTGAA TGCCATTTGG TTTCAGGATA TAGCTGACCT GGCATTTGAG 

GTATCAQGGA GGAAGCCTCA CCCATTCCCT AGTGTCAGCA AAATAATTGC TGACATGCTC 



TTCAACCTTT TGCTGCAGGC TCTTTTCCTC ATTCAGGGAA TGTTTGTGAG TCTCTTTCCC 
ATCCATCTTG TCGGTCAGCT GGTTAGTCTC CTGCATATGT CCCTTCTCTA CTCACTGTAC 
TGCTTTGAAT ATCGTTGGTT CAATAAAGGA ATTGAAATGC ACCAGCGGTT GTCTAACATA 
GAAAGGAATT GGCCTTACTA CTTTGGGTTT GGTTTGCCCT TGGCTTTTCT CACAGCAATG 
CAGTCCTCAT ATATTATCAG TGGCTGCCTT TTCTCTATCC TCTTTCCTTT ATTCATTATC 
AGCGCCAATG AAGCAAAGAC CCCTGGCAAA GCRTATCTCT TCCACTTGCG CCTCTTCTCC 
TTGGTGGTCT TCTTAAGCAA CAGACTCTTC CACAAGACAG TCTACCTGCA GTCGGCCCTG 
AGCAGCTCTA CTTCTGCAGA GAAGTTCCCT TCACCGCATC CGTCGCCTGC CAAACTGAAG 
GCTACTGCAG GTCACTGAGT TGCCTGCCAT CCAAAGGGGA TGGGCGGGAT TGGAAGAAGC 
TGTGGCAGCT CnTTCCCTG TTCACCTCCC GCCTGCCAGG GAAGGCAGGA CCCGCTCTGC 
CAAGGGCCCT CTGCGTATTC CCTTCTCTCT GAGGAATTGA AATTTTTGTC TCTGGTGCAC 
GTAAGGCAGA ATGTTCCCTG ACACCAGTGT GTGGATTTTT AACATCACCG TGAGTCTGAA 



60 
120 
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240 
300 
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420 
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540 
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960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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AGGACCACAG GTnTTCTGC AGCTATTTTC TAGCATTTGC CAGTCCCTGT GCCTGGACTG 1440 

ATTGGAACAC TTTGTTTTTC TCCCTC5TGCC ATTTACCCTT CCACXnTTCC ATCCTCCCTT 1500 

CTACCACCCT TGGATGAATG GATriTGTAA TTCTAGCTGT TGTATTTTGT GAATTTGTTA 1560 

AmTGTTGT TTTTCTGTGA AACACATACA TTGGATATGG GAGC3TAAAGG AGTGTCCCAG 1620 

TTGCTCCTGG TCACTCCCTT TATAGCCATT ACTGTCTTGT TTCTTGTAAC TCAGGTTAGG 1680 

TCTTGCTCCA CTGCAAAAAA AAAAAAA 1717 
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(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ3GTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
CCACGCGTCC GGTCACTATG TAGTGGAGGG GCAGACACCC TCCCGCAAAT TCTGGAAGGT 
TCTTAGTCTC GACTAGGGCA GTAGCCCCAG GACTCCTAGT CGCCGGCTTC AGGTCACTGC 
CGGCTGAACG GAGCTGCCGT CGCCATGTTT GGCTGCTTGG TGGCGGGGAG GCTQGTGCAA 
ACAGCTGCAC AGCAAGTGGC AGAGGATAAA TTTGTTTTTG ACTTACCTGA TTATGAAAGT 
ATCAACCATG TTGTGGTTTT TATGCTGGGA ACAATCCCAT TTCCTGAGGG AATGGGAGGA 
TCTGTCTACT TTTCTTATCC TGATTCAAAT GGAATGCCAG TATGGCAACT CCTAGGATTT 



GTCACGAATG GGAAGCCAAG TGCCATCTTC AAAATTTCAG GTCTTAAATC TGGAGAAGGA 
AGCCAACATC CTTTTGGAGC CATGAATATT GTCCGAACTC CATCTGTTGC TCAGATTGGA 
ATTTCAGTGG AATTATTAGA CAGTATGGCT CAGCAGACTC CTGTAGGTAA TGCTGCTGTA 
TCCTCAGTTG ACTCATTCAC TCAGTTCACA CAAAAGATGT TGGACAATTT CTACAATTTT 
GCTTCATCAT TTGCTGTCTC TCAGGCCCAG ATGACACCAA GOCCATCTGA AATGTTCATT 
CCGGCAAATG TGGTTCTGAA ATGGTATGAA AACTTTCAAA GACGACTAGC ACAGAACCCT 
NTNTTTTGGN AAACATAATT TGAATAAAAT AATTTTTAAT GGATTNTGNA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAA 



60 
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(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEI]»iESS : doiible 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 126: 

GGCACAGCCC AGGGCCTTGA AGCCAGCTGG CCCTGGAGAG GGGCTGCTGT GCCAGCTTGG 60 

GGAGGGTCTG CX3ATGGGGCT GCCCCTGATG GCCCTGATGT GGAGrTACCTT GCCAGCATCT 120 

GCTGGGGTCA ACTTTATTTT AGCCCTTCCC TTGTTGCTCT TATGGAAGAA CAGAGGAGGG 180 

GTGGGCAGGT CAGTGATGTC AGCAGTGGAG TGATTCCCAG CACAGCGGCT TCTGGGAAGA 240 

15 GGGCATGGAG GCATTTCTTT CAGGGAAATG GTCCATNATT TCAGCCAGAA GGCATTGCAT 300 

TAAC3TTAAGT CCNGGACTTT TGTGGCCCAG CTCTGTGTTA TTAAGGGCCC TTGGCGAAGA 360 

CTTCAAGGAG GGGGCAAAAN GACCTTTAAG TTTTTAGGTT TAACACAGGG AACCCNCAAA 420 

GGGTTATTTT G 431 
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(2) INFORMATION FOR SEQ ID NO: 127: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3752 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
NGGCACGAGG AGAGTCACCT GGACTCAGAA CTftGAGMAT CC?ATGACqC AGACAAAATT _ _ 6 0 

AAACTTCAGC TTTCTAAGCA TAAGGAGTTT CAGAAGACTC TTGGTGGCAA GCAGCCTGTG 120 

40 TATQATACCA CAATTAGAAC TGGCAGAGCA CTGAAAGAAA AGACTTTGCT TCCCGAAGAT 180 

ASTCAGAAAC TTGACAATTT CCTAGGAGAA GTCAGAGACA AATGGGATAC TGTTTGTGGC 240 

AAGTCTGTGG AGCGGCAGCA CAAGTTGGAG GAAGCCCTGC TCTTTTCGGG TCAGTTCATG 300 

45 

GATGCTTTGC AGGCATTGGT TGACTGGTTA TACAAGGTGG AGCCACAGCT GGCTGAGGAC 360 

CAGCCCGTGC ACGGGGGACC TTGACCTCGT CATGAACCTC ATGGATGCAC ACAAGGTTTT 420 

50 CCAGAAGGAA CTGGNGAAAG CGAACAGGAA CCGTTCAGGT CCTGAAGCGG TCAGGCOGAG 480 

AGCTGATTGA GAATAGTCGA GATGACACCA CTTGQGTAAA AGGACAGCTC CAGGAACTGA 540 

GCACTCGCTG GGACACTGTC TGTAAACTCT CTGTrTCCAA ACAAAGCCGG CTTGAGCAGG 600 

55 

CCTTAAAACA AGCGGAAGTG TTTCGAGACA CAGTCCACAT GCTGTTGGAG TOGCTTTCTG 660 

AAGCAGAGCA AACGCTTCGC TTTCGGGGAG CACTTCCTGG ATGACACAGA GGCCCTGCAG 720 

60 TCTCTCATTG ACACCCATAA GGAATTCATG AAGAAAGTAG AAGAAAAGCG AGTGGACGTT 780 
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AACTCAGCAG TAGCCATGGG AGAAGTCATC CTGGCTGTCT GCCACCCCGA TTGCATCACA 840 

^ ACCATCAAAC ACTGGATCAC CATCATCCGA GCTaXTTTOG AGGAGGTCCT GACATQQC5CT 900 

AAGCAGCACC AGCAGCGTCT TGAAACGGCC TTGTCAGAAC TGGTGGCTAA OXSCTGAGCTC 960 

CTCGAAGAAC TTCTGGCATG GATCCAGTGG GCTGAGACCA CCCTCATTCA GCGGGATCAG 1020 

10 GAGCCAATCC CGCAGAACAT TGACCX5AGTT AAAGCCCTTA TCGCTCAGCA TCAGACATTT 1080 

ATGGAGGAGA TGACTCGCAA ACAGCCTGAC GTGGACCGGG TCACCAAGAC ATACAAAAfSG 1140 

AAAAACATAG AGCCTACTCA CGCGCCTTTC ATAGAGAAAT CCCGCAGCGG AGGCAGGAAA 1200 

15 

TCCCTAAGTC AGCCAACCCC TCCTCCCATG CCAATCCTTT CACAGTCTGA AGCAAAAAAC 1260 

CCACGGATCA ACXAGCTTTC TGCCXX3CTGG CAGCAGGTGT GGCTGTTAGC ACTGGAOCGG 1320 

20 CAAAGGAAAC TGAATCATGC CTTGGATCGG CTGGAGGAGT TGAAAGAATT TGCCAACTTT 1380 

GACTTTGATG TCTGGAGGAA AAAGTATATG CXTTTGCATGA ATCACAAAAA GTCTCGAGTG 1440 

ATGGATTTCT TCCGGCGCAT TGATAAGGAC CAGGATGGGA AGATAACACG ICAGGAGTTT 1500 

25 

ATCGATGGCA TTTTAGCATC CAAGTTCCCC ACCACCAAGT TAGAGATGAC TGCTOTGGCT 1560 

GACATTTTCG ACCGAGATGG GGATGGTTAC ATTGATTATT ATGAATTTOT GGCTGCTCTT 1620 

30 CATCCCAACA AGGATGCGTA TCGACCAACA ACXXSATGCAG ATAAAATCGA AGATGAGGTT 1680 

ACAAGACAAG TGGCTCAGTG CAAATGTGCA AAAAGCTTTC AGGTGGAGCA GATCGGAGAG 1740 

AATAAATACC GGTTCTTCCT CGGCAATCAG TTTGGGGATT CTCAGCAGTT GCGGCTGGTC 1800 

35 

CGTATTCTGC GCAACCGTGA TGGTTCGCGT TGGTGGAGGA TGGATGGCCT TGGATGAATT 1860 

TTTAGTCAAA AATGATCCCT GCCGAGCACG AGGTAGAACT AACATTGAAC TTAGAGAGAA 1920 

40 ATTCATCCTA CCAGAGGGAG CATCCCAGGG AATGACCCXX: TTCCGCTCAC GGGGTCGAAG 1980 

GTCCAAACCA TCTTCCCX^GG CAGCTTCCCC TACTCGTTCC AGCTCCAGTG CTAGTCAGAG 2040 

TAACCACAGC TGTACATCCA TGCCATCTTC TCCAGCCACC CCAGCCAGTG GAACCAAQGT 2100 

45 

TATCCXATCA TCAGGTAGCA AGTTGAAACG ACCAACACCA ACmTCATT CTAGTCGGAC 2160 

ATCCCTTGCT GGTGATACCA GCAATNAGTT CTTCCCCGGC CTCCACAGGT GCCAAAACTA 2220 

50 ATCGGGCAGA CCCTAAAAAG TCTGCCAGTC GCCCTGGGAG TCGGGCTGGG AGTCGAGCCG 2280 

GGAGTCGAGC CAGCAGCCGG CGAGGAACTG ACGCTTCTGA CTTTGACCTC TTAGAGACGC 2340 

ATTGCTTGTT CCGACACTTC AGAAAGCAGC QCTGCAGGGG GCX:AAGGCAA CTCCAGGAGA 2400 

55 

GQGCTAAACA AACCTTCCAA AATCCCAACC ATGTCTAAGA AGACCACCAC TGCCTCCCCC 2460 

AGGACTCCAG GTCCCAAGCG ATAACACTGT CTAAGCACCC CCAAGCCACT ATCCACTTTG 2520 

60 AATCCTGCTC CATACATTGG GTGTATATTT ATTCTGAACG GGAGAAGTTA TATTGTTAAA 2580 



tSDCX^tO: <W0_983344aAa.l_> 



wo 98/39448 



PCT/US98/04493 



350 



AGTGTAAAAG AATAATTGTG TTATGAAGCT GCCTTATTTT OTl ' lCrmT GTAAGTTACT 2640 

ATTTTCATGT GAATATTTAT GTAGATAAAA TTTGCCTCCT QGTAACCCTG TAATGGATGG 2700 

5 

GGCCCAGAAA TGAAATATTT GAGAAAAACA AGTGAAAAGG TCAAGATACA AATGTC3TATT 2760 

AAAAAAAAAA AAGCCTATTA ATAGGGTTTC TGCGCGGTGC AGGGTTOTAA ACCTGCTTTA 2820 

10 TCTTTTAGGA TTATTCCTAA ATGCATCTTC OTTATAAACT TGACTTGCTA TCTCAGCAAG 2880 

ATAAATTATA TTAAAAAAAT AAGAATCCTG CAGTGTTTAA GGAACTCTTT TTTTGTAAAT 2940 

CACGGACACC TCAATTAGCA AGAACTGAGG GGAGGGCTTT TTCCATTGTT T7ATGTTTTG 3000 

15 

TGATTTTTAG CTAAAGAGAG GGAACCTCAT CTAAGTAACA TTTGCACATG ATACAGCAAA 3060 

AGGAGnrCAT TGCAATACTG TCTTTGGATA TTGTTTCAGT ACTGGGTGTT TAAAGGACAA 3120 

20 ATAGCTGCTA GAATTCAGGG GTAAATGTAA GTGTTCAGAA AACGTCAGAA CATTTGGGGT 3180 

TTTAAACTGA TTTGrTTQCTC CCTATCCAGC CTAGACACCA GTAACTCTTG TGTTCACCAG 3240 

GACXrCAGACC CTTGGCAAGG GATAGGCTCG TTGGTGACAT TGTGAATTTC AGATTTGTTT 3300 

25 

TATCCACTTT TTTTGCTATT TATTTAAATG GTCGATCAAC TTCCCACAAA CTGAGGAATG 3360 

AATTCCACGA GCCTGTTCTG AAAATGTQGA CX5TAAGACAA ACACGTGCTC GTCCTTTAAT 3420 

30 GGAGTTCACC AGCACACTTG TTAACCAGTC CTCTTTGCTT TCGTCTTTTT TTGTGCGTAA 3480 

TAAAGTCAAC TGACCAAGTG ACCATGAAAA GGGGCTGTCT GGGGCTCCTG TTTTTTAGCT 3540 

GCTGTTCTTC AGCTCCGACC ATGTTGCTGT GTGATTATCT CAATTGGTTT TAATTGAGGC 3600 

35 

AGAAACTGAA GCTCTACCAA TGAACTGTTT AGAAACAAGA CACACTTTTG TATTAAAATT 3660 



GCTTGCAGTA ACAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAACTCGAGG GGGGCCOGGT 3720 
40 ACCCAATTCG CCGTATATGA TCGTAAACAA TC 3752 



45 (2) INFORMATIC»I FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS; 

(A>. LENGTH: 1144 base pairs 
(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

55 TGACCCTCTG CCTGCCGGGC TCAGTGCTGG ACGCTTTCTG TTTTGTCGCA GTCGGTCCTC 60 

GGTAACACCA GCGGCCTGTG GTCCACCACT CCAITCAGCA GCTCCATTTG GTCCAGCAAC 120 

CTTAGCAGCG CCTTCCCTTC ACCACTCCAG CAAACAOGCT GGCAAGCATC GQCCTCATGG 180 

60 
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GCACAGAAAA 
GACAGACCTA 
CTTGGTCTAA 
GTGGGCCCTC 
TGCAGCTCCC 
GACCACTCGC 
CTTTTAGAGA 
CTTTCCCGTT 
ACAGATTTGA 
ACAATTTTGr 
TCTAATTTTA 
GCTGTCTTGG 
TGTAAAAACA 
AACATTTGTA 
AGATACAAGA 
ATGATGCTTT 
CAAT 



CTCCCCTGCT 
CAACCXGTQG 
TTCGCACTTT 
GTCTAGATCA 
CTCCXXACCC 
TTCTTTCAGA 
AAAGTCTGGA 
ACTGCGTTTT 
CATTACAGCT 
TAAATTACCA 
AAACTGTATG 
TTTAAAAGCA 
TTCAGTYGAG 
TACCTGAACT 
TTGAATGTGT 
TTTCAAAAAA 



351 



CXTCACGCTC CXTTCCACCTC CAGTCCAGCT GACGACTTGG 240 

CGGATATGGA GCCCCACGAT TCGAAGAAGA AGCTCGGACX: 300 

CCTCACGAGA ATTAAATTAA GCAAAAAACA AACAAACATA 360 

TGATGTGCCA GTTTCTGAGA CATCTTTTTA AGGCTCTTAC 420 

TCCTCTTCTT TGCAAAACAG ACCCAAGCAG GGCAGGCTCA 480 

TCTTTCTTGC AATTATGATA ACATGAGATT TGCTGTTGTC 540 

CTCAGCCACA AACTCTAATA AGACCTGTAC ATCTGAGAAC 600 

CACCACCTGT CTTCCCCATG CTTTATTTAT CTGTATGAAC 660 

AAGGAAATAA TTTGAGTTGA TTCAGAAATC CTGGCATGTG 720 

AGTTTGGTTT TTAATAATTT CTCAATATTA TGCXXTCAAGA 780 

AQGACnTGT GCTGAAAATA GAGTATTTTT TTAAACTTAAG 840 

GATTACAGAA ATGTAAGTCA ACTTAAGAAC RGTGAATGAA 900 

ACCATATGCA TTTTCTGTGC TGTTTGTACT TGAGGTATGT 960 

TATTTTAAAG ATGAACTGAA ATGCACATAG CCAAGTCTTG 1020 

ATTTCTTAAA AATACAACTT TGTGTTGTAC TTTGAAATAA 1080 

AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG GCCCX3GTACC 1140 

1144 



40 



45 



50 



55 
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(2) INFORMATION FOR SEQ ID NO; 129: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1830 base pairs 

(B) TYPE: nucleic acid 

(C) SnvaroEtSNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GCATGCAGAG GAGCACCCTG AGCGTGTYCC TGGAGCAGGC 
ACGGGTTGCT GCCCAAGTGC ATCATGCAGG CCACGGACAT 
GGTGGAGATT CTGQCCAAAA ACCTGCGAGT CAAGGACCAG 
CCTCTACCGC CTCTGCCAGC CGCCGGTGGA TGGGGACCTC 
GCTGGGCCGC GGCCTCTGGA GCTGGGATTT GGGAGGACAC 
TCCAGGGATG GCCCAANGCT TCCGCARCCG CCCGTTCCGG 
TGCCTCCTTC CGGGACAAGC CTGGCCACCC TCGCTGTGAT 



129: 

GGCCATSTTG GCACGGAGCC 60 

CATGCGGAAC AGGGCCCAAG 120 

ATGCCCCAGG GTGCTCCGCG 180 

TGAACACCCA AATGCCCCAC 240 

AGCAGGCAGC GCTGGCCTTC 300 

GACCTGCCCA GCGTCCTCCC 360 

GACGAGCTGG CTGATTGGCC 420 
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CTGGGCCGGC CCATTCrrCA CACGCCTGCC AGAAGCTGGA GGGGTGCTGG AGACCCATAG 480 

^ AGCTGATGGG AGCAGCTGGT GCCTGGCCTT CGGCTCCTGC GTCCCCAGAA CCCAAGGGAA 540 

CGTCATGGAG GCCACATGGG GCCACCCGGC TCCCTCGGGA TGGCTCCGCT GCACrrTTGA 600 

AACCCXX3GTT TCCTTCAACG TCCACATTCC AGGTGACCAC ACGTCTCTCC TCCTCCTCAT 660 

10 CTTAGCTTCC AGGTTCACCC TAACXrCTGTA CTAACCTGCT TGGTGGACTT QGAAAAGACT 720 

TGGCTCTCTC GGGAAAGGAG AGAOGGGGCC TCCATCACGC CTGTTACCAG AGGATCCCCG 780 

AGAGCCACAC CAGCTCTCGA CATCACCX3CC CCTGGAACTG GGGCCACCAG CCCTGGGCAC 840 

15 

GAGATTTGCT CTGACTTTAT TTATATGGCA TCAAATCTCT GGTTTATTTT GGGATTTTTT 900 

GTTGTTGGTG TTGTCAAAGT TTGTTTTTTC TAAAGTTGTG TGATTATATA TTTGACATTr 960 

20 TACATTTCAA AGAAAGGTAT GTTGTCTAAC AGGGGACCAA CAGAAGGTAG TATTGACAAC 1020 

TGTTCCTGCT TCTACTAAAA AAAAAAGAGC ACAAAAGAAA AACTAAATTA TTGAAAAATT 1080 

AAAAAATGTC ATTGTTTCCT GTTTGTTAAT ATTAGGGTTG TAAGGTGTCG TTTTGAGGTA 1140 

TCGACTGTGA TTCCTTCCCC CACCCTCCAT TCTCCAGCX5G TTGGCCGGTG TTAGAACTCG 1200 

CTCTCTTTGA GTGACTGGCT ACAAGGGCXTT GAGAGGTGGC CAGCCAGQGT TGGAGCTGGA 1260 

GGGGATGGAG CCCCACCTGA GGTGCCGTGT CACACQGGTT AGAGGGTCAC TGGGAAACAC 1320 

CXSGGCGGTGG CTTCTGTGAT TTATTTTCTT GATGGTAACT TCTCAGAGCA GGGCRATTGG 1380 

GACATCACCA GCCAGAGCAC AGGAAGCCAC CCTGCCTGCT GGGGAGGAGG GACCCACACA 1440 
AGCCCCCTCG ,G<^GTmGTC CCOXAGCTT CGGTAT GCCT TC AGGGAAAG CTCACAQCTG „ _ 1500 , 

GGGAGGAAGC GGGGGGACGC CTGTCACCCC TGGCAGGTGG TGAGTTCAGG TGGGGGCTCC 1560 

CTGCTKCCCC CAGGCCTGGG AGCTTGAAGC CCTCCCGGCA TCTGGCATCC GAGCCTCCCX3 1620 

CCCTCCAGGG TGCGCTTCCC TCTCTTGCCG CAGCATACAC GAGGGCAGGC AGTGGCCTTG 1680 

TCACTGTATC TTGCATCAGA GACAAAGGAG GACCCGCTTT AGCCCTGCTG CGGGAAATGG 1740 

GGGATGGCCC AGGGCCAGCX5 CATTGTGCAC TGGTTTACTT TAAAATGTAC AGATTCTTCT 1800 

CGTTAAATTC TTGATAGATT TTTTATTATT 1830 



25 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEO^SS: double 

(D) TOPOIiOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

GGCCX3CCCGG ATGGCGACCX: CAGCCTCGGC CCCAGACACA CGGGCTCTQG TGGCAGACTT 60 

TGTAGGTTAT AAGCTGAGGC AGAAGGGTTA TGTCTGTGGA GCTGGCCCCG OGGAGGGCCX: 120 

AGCAGCTGAC CCGCTGCACC AAGCCATGCG GGCAGCKGGA GATGAC5TTCG AGACCCGCTT 180 

CCGGCGCACC TTCTCTGATC TGGCGGCTCA GCTGCATCTG ACCCCAGGCT CAGCCCAACA 240 

ACGCTTCACC CAGGTCTCCG ATGAACTTTT TCAAGGGGGC CCCAACTGGG GCCGCCTTGT 300 

AGCCrrCTTT GTCTTTGGGG CTGCACTGTG TGCTGAGAGT GTCAACAAGG AGATGGAACC 360 

ACTGGTGGGA CAAGTGCAGG AGTGGATGGT GGCCTACCTG GAGACGCGGC TGGCTGACTC 420 

GATCCACAGC AGTGGGGGCT GGTTATCCCA GATCACTGAA GCTGAGATGG CTGATGAAGT 480 

AATTTGCAC3T GAAATTTTAA GCGACTGTGA CTCTGCTGCA AGTTCCXX::AG ATCTTGAGGA 540 

GCTGGAAGCT ATCAAAGCTC GACTCAGGGA GATGGAGGAA GAAGCTGAGA AGCTAAAGGA 600 

GGTACAGAAC GAGGTAGAGA AGCAGATGAA TATGAGTCCA CCTCCAGGCA ATGCTGGCCC 660 

GGTGATCATG TCCATTGAGG AGAAGATGGA GGCTQATGCC CGTTCCATCT ATGTTGGCAA 720 

TGTGGACTAT GGTGCAACAG CAGAAGAGCT GGAAGCTCAC TTTCATGGCT GTGGTTCAGT 780 

CAACCX5TGTT ACCATACTGT GTQACAAATT TAGTGGCXyVT CCCAAAGGGT TTGCGTATAT 840 

AGAGTTCTCA GACAAAGAGT CAGTGAGGAC TTCCTTGGCC TTAGATGAGT CXTCTATTTAG 900 

AGGAAGGCAA ATCAAGGTGA TCCCAAAACG AACCAACAGA CCAGGCATCA GCACAACAGA 960 

CCGGGGTTTT CCACGAGCCC GCTACCX3CGC CCXjGACCACC AACTACAACA GCTCCCGCTC 1020 

TCGATTCTAC"AGTGGTTTTA~ACAGCAGGCC^^ 1080" 

AGCGACATCA TGGTATTCCC C^ACTAAAA AAAGTGTGTA TTAGGAGGAG AGAGAGGAAA 1140 

AAAAGAGGAA AGAAGGAAAA AAAAAAGAAT TAAAAAAAAA AAAAAAAAAA ACAGAAGWTG 1200 

MCCTTGATGG AAAAAAAATA TrmTAAAA AAAAGATATA CTGTGGAAGG GGGGAGAATC 1260 

CCATAACTAA CTGCTGAGGA GGGACCTGCT TTGGGGAGTA GGGGAAGGCC CAGGGARTGG 1320 

GGCAGGGGGC TGCTTATTCA CTCTGGGGAT TCGCCATGGA CACGTCTCAA CTGCGCAACT 1380 

GCTTGCCCAT GTTTCCCTGC CCCACCCCAC CCCTCTTCTC CGGCTCCCTG CCCCTCCAGA 1440 

TTGCCTGGTG ATCTATTTTG TTTCCTTTTG TGTTTCTTTT TCTGTTTTGA GTGTCTTTCT 1500 

TTGCAGGTTT CTGTAGCCGG AAGATCTCCG TTCCGCTCCC AGCGGCTCCA GTGTAAATTC 1560 

CCCTTCCCCC TGGGGAAATG CACTACCTTG OTTTGGGGGG TTTAGGGGTG ' r m'iXJrrX' X ^ 1620 

TCAGTTGTTT TGTTTTTTTG TTTTTTTOTT TTTCCTTTGC CTTTITTCCC TTTTATTTGG 1680 

AGGGAATGGG AGGAAGTGGG AACAGGGAGG TGGGAGGTGG ATTTTGTTTA TTTTTTTAGC 1740 
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TCATTTCCAG GGGTGGGAAT TTTTTTTTAA TATGTGTCAT GAATAAAGTT GTTTTTGAAA 1800 
AKAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1660 
AAAA 1864 



(2) INFORMATION FOR SBQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2041 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



GGCACGAGCG CGCGGCAGGG CCCTGGACCC GCGCGGCTCC CGGGGATGGT GAGCAAGGCG 60 

CTGCTGCGCC TCGTGTCTGC CGTCAACCGC AGGAGGATGA AGCTGCTGCT GGGCATOGCC 120 

TTGCTGGCCT ACGTCGCCTC TGTTTGGGGC AACTTCGTTA ATATGAGGTC TATCCAGGAA 180 

AATGGTGAAC TAAAAATTGA AAGCAAGATT GAAGAGATGG TTGAACCACT AAGAGAGAAA 240 

ATCAGAGATT TAGAAAAAAG CTTTACCCAG AAATACCCAC CAGTAAAGTT TTTATCAGAA 300 

AAGGATCGGA AAAGAATTTT GATAACAGGA GGCGCAGGGT TCGTGGGCTC CCATCTAACT 360 

GACAAACTCA TGATGGACGG CCACGAGGTG ACCGTGGTGG ACAATTTCTT CACGGGCAGG 420 

AAGAGAAACG TGGAGCACTG GATCGGACAT GAGAACTTCG AGTTGATTAA CCACGACGTG 480 

TGGAGCCCCT CTACATCGAG GTTGA CCAGA TATACCATCT GGCATCTCCA GCCTCCCCTC 5AD 

CAAACTACAT GTATAATCCT ATCAAGACAT TAAAGACCAA TACGATTGGG ACATTAAACA 600 

TGTTGGGGCT GGCAAAACGA GTCGGTGCCC GTCTGCTCCT GGCCTCCACA TCGGAGGTGT 660 

ATQGAGATCC TGAAGTCCAC CCTCAAAGTG AGGATTACTG OGGCCACGTG AATCCAATAG 720 

GACCTCGGGC CTGCTACGAT GAAGGCAAAC GTGTTGCAGA GACCATGTGC TATGCCTACA 780 

TGAAGCAGGA AGGCGTGGAA GTGOGAGTGG CCAGAATCTT CAACACCTTT GGGCCACGCA 840 

TGCACATGAA CGATGGGCGA GTAGTCAGCA ACTTCATCCT GCAGGCGCTC CAGGGGGAGC 900 

CACTCACGGT ATACGGATCC GGGTCTCAGA CAAGGGCGTT CCAGTACGTC AGCGATCTAG 960 

TGAATGGCCT CGTGGCTCTC ATGAACAGCA ACGTCAGCAG CCCGGTCAAC CTGGGGAACC 1020 

CAGAAGAACA CACAATCCTA GAATTTGCTC AGTTAATTAA AAACCTTGTT GGTAGCGGAA 1080 

GTGAAATTCA GTTTCTCTCC GAAGCCCAGG ATGACCCACA GAAAAGAAAA CCAGACATCA 1140 

AAAAAGCAAA GCTGATGCTG GGGTGGGAGC CCGTGGTCCC GCTQGAGGAA GGTTTAAACA 1200 

AAGCAATTCA CTACTTCCGT AAAGAACTCG AGTACCAGGC AAATAATCAG TACATCCCCA 1260 
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AACCAAAGCC TGCCAGAATA AAGAAAGGAC GGACTCGCCA 
AGGACACAAG ACTACCATTG TACACTTGAT GGGATGTATT 
GTTTAAAGAA AGACTTTAAC AGGTC3TCATG AAGAACAAAC 
aX3CTTTAATG AAATQGATGT GCCTAAAAGC TCCCCTCAAA 
CACTTTTTGA ATCTCTCTTT TTATGTAAAA TAGCGTAGAT 
GTTTTTTTAT CTTGCTGTGA GAGCATATGT TGTGACTGTC 
GTTTCTTTGT GAAGCTGAAA AGGAACATTA AGCGGGACAA 
AAAAGTGGGT ACTTAATAAA TGAGTCGTTA TACTATGCAT 
nCTCAGGTG GTGGTGCX5CC GGCATTGATT TTAGGGCAGA 
GCTTTATGTT TCTCTTTTAA TTCAGAGTTT TTCCAAGGTC 
GACTTTGAAA TATTCCTGTT GGTCATGATC AAGGATATTT 
CTGCX5TATCT GGGGCGGGGG CAGGTTGGGG GGCACAAAGT 
ATGGTTAAAT ATGCTATTTT AATAAAATAT TGAAACICAC 
A 



CAGCTGAACT 
TTTGGCTTTT 
TGGAATTTCA 
AAACTGCAGA 
GCATCTCTGC 
GTTGACAGTT 
AAAATGCCGA 
AAAGAAAAAT 
TAAAAGAATT 
TACTTTTGAG 
GAAATCACTA 
TAACATATTC 
CAAAAAAAAA 



CCTCACTTTT 

TTCTGAAGCT 
TTTTGCCTTG 
GTATTTTCAA 
TTATTTACTG 
TTTTATTTAT 
CCTAGCAGTA 
CTGTGTGAGA 
TTGCAAACTT 
CTGTGTTTTG 
TTGGTTAACC 
AAAAAAAAAA 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2041 
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(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 2012 bcise pairs 
"TBT ^TiPEz nucleic" acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TACCAAGCTG CAAGAATCTA CTATATCATG GCAGAAGAAG 
GACCGGAGCT GGGAACGGGA ATGGCACAAC CAGTCTGAGA 
TTCCTGAGCA ACAAGGATGG GCTCCTGGGT TCCAGATACA 
TACACTGATG GTACATTCAG GNTCCCTCGG CCAAGGACTG 
ATCTTGGGTC CACTTATCAA AGGTGAAGTT GGTGATATCC 
AATGCCAGCC GCCCCTACTC TGTGCATGCT CATGGAGTGC 
CCACTGGCTG CTGAGCCTGG TGAGGTGGTC ACTTATCAGT 
GGCCCTGGGC CAATGACTCT GCTTGTGTTT CCTGGATCTA 
TCAAGGACAT GTATAGTGGC CTGGTGGGGC CCTTGGCTAT 



132: 

TAGAGTGGGA CTATTGCCCT 60 

AGGACAGTITA TGGTTACATT 120 

AGAAAGCTGT ATTCAGGGAA 180 

GACCAGAAGA ACACTTGGGA 240 

TGACTGTGGT ATTCAAGAAT 300 

TAGAATCTAC TACTGTCTQG 360 

GGAACATCCC AGAGAGGTCT 420 

TTATTCTGCA GTGGATCCCA 480 

CTGCCAAAAG GGCATCCTGG 540 
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NAGCXXXIATG GAGGACGGAN TGACATGGAT OGGGAATTTG CATTGTTGTT CTTGATTTTT 600 

GATGAAAATA AGTCTTGGTA TTTQGAGGAA AATGTGGCAA CXX:ATGQC3TC CCAGGATCCA 660 

GGCAGTATTA ACCTACAGGA TGAAACTTTC TTGGAGAGCA ATAAAAOXSCA TGCAATCAAT 720 

GGGAAACTCT ATGCCAACCT TAGQGGTCTT ACCATGTACC AAGGAGAAOG AGTGGCCTGG 780 

TACATGCTGG CCATGGGCCA AGATGTGGAT CTACACACCA TCCACTTTCA TGCAGAGAGC 840 

TTCCTCTATC GGAATGGCGA GAACTACCGG GCAGATGTGG TGGATCTGTT CCCAGGGACT 900 

TTTGAGGTTG TGGAGATGGT GGCCAGCAAC CCTGGGACAT GGCTGATGCA CTGCCATCTC 960 

15 ACTGACCATG TCCATGCTGG CATGGAGACC CTCTrCACTG TTTTTTCTCG AACAGAACAC 1020 

TTAAGCCCTC TCACCGTCAT CACCAAAGAG ACTGAAAAAG CAGTGCCCCC CAGAGACATT 1080 

GAAGAAGGCA ATGTGAAGAT GCTQGGCATG CAGATCCCCA TAAAGAATGT TGAGATGCTG 1140 

20 

GCCTCTGTTT TGGTTGCCAT TAGTGTCACC CTTCTGCTCG TTGTTCTGGC TCTTGGTGGA 1200 

GTGGTTTGGT ACCAACATCG ACAGAGAAAG CTACXWXXZA ATAGGAGGTC CATCCTGGAT 1260 

25 GACAGCTTCA AGCTTCTGTC TTTCAAACAG TAACATCTGG AGCCTGGAGA TATCCTCAGG 1320 

AAGCACATCT GTAGTGCACT CXXIAGCAGGC CATGGACTAG TCACTAACCC CACACTCAAA 1380 

GGGGCATGGG TGGTGGAGAA GCAGAAGGAG CAATCAAGCT TATCTGGATA TTTCTTTCTT 1440 

30 

TATTTATTTT ACATGGAAAT AATATGATTT CACTTTTTCT TTAGTTTCTT TGCTCTACGT 1500 

GGGCACCTGG CACTAAGGGA GTACCTTATT ATCCTACATC GCAAATTTCA ACAGCTACAT 1560 

35 TATATTTCCT TCTGACACTT GGAAGGTATT GAAATTTCTA GAAATGTATC CTTCTCACAA 1620 



40 



AGTAGAGACC AAGAGAAAAA CTCATTGATT GGGTTTCTAC TTCTTTCAAG GACTCAGGAA 1680 

ATTTCACTTT GAACTGAGGC CAAGTGAGCT GTTAAGATAA CCCACACTTA AACTAAAGGC 1740 

TAAGAATATA GGCTTGATGG GAAATTGAAG GTAGGCTGAG TATTOGGAAT CXAAATTGAA 1800 

TTTTGATTCT CCTTGGCAGT GAACTACTTT GAAGAAGTGG TCAATGGGTT GTTGCTGCCA 1860 

45 TGAGCATGTA CAACCTCTGG AGCTAGAAGC TCCTCAGGAA AGCCAGTTCT CXIAAGTTCTT 1920 

AACCTGTGGC ACTGAAAGGA ATGTTGAGTT ACCTCTTCAT GTTTTAGACA GCAAACCCTA 1980 

TCCATTAAAG TACTTGTTAG AACACTGAAA AA 2012 

50 



<2) INFORMATION FOR SEQ ID NO: 133: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base, pairs 

(B) TYPE: nucleic acid 
(C> STRANDEE»IESS: doxible 

60 (D) TOPOIjOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

GAGCAGTATT TTAACCAACT TGTATTACAG ATGTTACAGT TCATGTTAGG AAGTCAGAAA 60 

5 

AGACTTTGTT TGTCTTTGTT CTGCTGATGT GAGTCATOTT TTGTQGGGTC TTCCATGGCA 120 

CATTTACCTG TTGCTCCGTC CAGATGTTGA GGGCCAGTCT AGGCTGACAC ATCCTACCCG 180 

10 AGGACAAGCC TGTTCTCCAT TTCTTCACTC TCCCCTCCCC ATATAGCAAC TCTCCCAGGT 240 

TTAGATTACC GTTTTCGACG ACAGATTAAC CAAAAATGCC CCACACAGGT TTTATTACTG 300 

TTATATACTA TACTTTTAAC AGTACAGACC CTAAATTTTA TTAnTCTTG CTCCCCCAAT 360 

15 

CTGATACCAA ATGTTTAAAG TTGTTTGAAA TCCAAACATG GTAGTGTTCA TGGGTAAATA 420 

TTTTCTAGGC TATGTAAGAG TTAGCAGCCC ATAGCATAGA AGTAATCAAG TAGCATCTGA 480 

20 GACTGTTGGA GGCACTAGGG CCTCTCTGGG CCTAACAGCC TCACTTCCCC AGCCTCACCT 540 

TGCTGTCCTC TGT^CTGCC ATCAGGGCTG TTAGTGGCAC CTGTATGAGG CCAAGTGTGC 600 

GTCCAGGGGA ACAGCACAGG TTAATGCGTC TCCCTAGAAC TCATGAAGTC AGTTTAATTC 660 

25 

ATGCATGAAC ATGAGTTCAT TTTATGTTTT ATATAGCTTT CTTAGACATA CCAAACCATC 720 

ATTCATAAAT CAGATAAATT ATTCAGTTTT TGTGTTTAGA AAGCTAAGTA TGTGTAGCTG 780 

30 GAAACAAAAA TGAGCGTGTT TTCTCTCCTG TTAATCTAGA GTGTGCAGTT ACACATGTGT 840 

GGATAATTTC ATGTTCCAGG GGCGCTTGGC ATCTCCCATG GACTGATTCC CAGGAAGAAA 900 

AGCCCAAAGG GAAACCCACG ATTCCTTTCG AGTAGATGTG GGAAAGAGCC CATTGGAGGA 960 

35 

TATGAGGTCC TGTGAAATTC AGTTGTGTGT GTGGCTCCTT GTTAGCAGTC ATGTTGACAT 1020 

GGTGTTAGGA GGCTCCCCAT CCACCCTTTA CATGATGTAG GGACCAGTGT CTTGTGA6AT 1080 

40 TAACCTTGGG ACACAGTGGG TTAGCCTGGA GAAAATGAGA GGCCCTGCCT GGACCCAGGG 1140 

AGAGGAGCCA GTGACACAGG CAGAGCGGTG CAGCCCTCCT TCCCTTCCAT TTGGAGGAGG 1200 

TGGTGCCAGG AGCCTGCCCG CTTACCTCTG CTGAAGCATA AGTGGACTTT GCTTTTGGGG 1260 

45 

CTTATCTCTG ATACATGCTG GAGCCCTGCC TCTCCACTGC TAGATGGAAC CTGGAATCTC 1320 

TCATCTACCT CTTAGTCTGT CAGTTTCTAC GTGTGAGAAG CAAGCTTGTG GGCCAGTGTC 1380 

50 CTTGTACATG CTGTAGCACT TAAAAAATAA TTCCAGGGTT CCCTGGAAAA CCAGTCCCAG 1440 

GGTTCCTATG ATCTGTAGTT TCTACCTGGA TTATAACTGG TTTrGGGTAC CTGAATTTTG 1500 

ATTGGTTAGC CTTAATTATA GTCTGGCGTG ATCATGTAGA ATCTTTTCTG GTGAACAGAT 1560 

CATAAAGTTC TATCAAGGAG TTCTATCAAG GCATCCATGT CAGTGGTGCT ATGCTGGTTA 1620 

CAACTTGAGA TTTTTGAAAT AAAAAATTTG TCATAAAAAA AAAAAAAAA 1669 

60 
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(2) INFORMATION FOR SBQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 1565 bcise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOIiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CACTTTTGCT ATATAACCTA AGTGATAACC CTCTTTTAGT TACCTGCCAA ACTCTGGNCT 60 

15 TGGTTTATAT TGCAGTTAAC ACAGTTACAA AGCTGTAATG GTGTCTTTTT TTCCTTTGTA " 120 

ACGGAATGTG TAAATCAAAG TATATACATT GTGTGGTGTT CCTGTTTCTG GAGTTTCATG 180 

AGGATTTACA CATGGCATTC AGTGTTCTGT ATAGATCTGC CTACCTTTGT GAATTCATCT 240 

20 

GTTAACCCCT CTTCCTTTGA GAGAGCACCG GCGATGGTGG TTAACTCCTT GTGTTTTCTC 300 

TCTCTCCTAC TGGTTATTCT TGAATTAAGC ACAGACTCGT CAGCTCGGTT GCTTTATCAT 360 

25 GAATAATGTG TGTGACCTTG CAGTrCTTCC ACAGTTCAGC AAACAAGTGC TAGCTTCACT 420 

GAGCAAAAAT TAAGGAAGGA AAACACAGTT TTTAAAACGA TCCATCTTTT AACAGCCGAA 480 

ACCGATGTGT CTATGGTGCT GCACCTTGCT GTTGTACTTC TGAAATCAGA CGTGTGTGAA 540 

30 

CGATCATTTC TGACTTAACC GTGAGATGCT CACGAGTACC CTTCCTGTTG TTTTGTTAGC 600 

ATTGAAATCG AGACTATTTA TTTGGAATAT ATACAACAGT GTTTTTCCAC TGTATTTCAT 660 

35 TTGCAAAAGT TGAGAACTGC TTTCTCTACC TTTTGCAAAA TAATTGATAT TCCATATTGG 720 



ATTCTCAAAG ACTTCGATAT GGTGAACCTA TTAAACCTAG AAATTGTATT CATCCTTTCA 780 

TGACTGTGGC CTGAGTTCCC CAGCCCCTCT CCTCCTTTTT TTTAGATGAG ATTTAGCACA 840 

CTCTCAGTTA TTTAAACATG CAACATTTCT TGAGTATGTA TGTTGAGGCC ATCTGAGCTC 900 

ATAGCTGATT CAGTAACCAG TTTCATGCTG TGTCATTCAC ACTCACTACT TAATACTGCC 960 

45 ATGGTGAAAA TGTGGAGGAA AAATGTATCC ATGTGTGTCT GGGAAGCATA TACACTTGTA 1020 

CATTTTTTAA TACTCTGATT CTGTAACATT TCTGAGTTTT GTTTTGTTTT ACAGNAAAAA 1080 

AAAAAAAAGT GATAAAGCAA TCAGAAGACC AAGAGGTTTA CTATTGATGC TTAGGGTCGT 1140 

50 

CTGACCTTGG CTQGCCAATA GACCTACACG GCCAAATTAA TTTACGAGAG TAATAATTTT 1200 

TCAAAAGCCA ATTTTTTTTC TGTATTTTCT GTATGAAACT GCCAATATCA TGAATAGAAA 1260 

55 GGGAGAACCA TAAAGGAGAA AGAACGTGAT GTTCTGTTAT GTTCATGTAA ACCTAAAGAA 1320 

ACAGTGTGGA GGCAGGCGCG ATCAGCCGAA CTCTAGGGAC TTGGTGTTGC TTGGAAGGCA 1380 

TCCATACCTG CATTTTGCAT TCTTCGTATG TAATCATATT GCCAAAGACA AACTATTTCA 1440 

60 



wo 98/39448 



PCT/US98/04493 



359 

TCATTTATTG TAAATAACAC TTTTCCCCAG ACCTACCATA AAGTTTCTGT GATCTATTGT 1500 

CTTCCAGTTG CAATAAAAAT TACTGAGTTG CATCAATTGA AGAAAAAAAA AAAAAAAAAA 1560 

5 CTCGA 1565 

10 (2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2007 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEEB^ESS : double 

(D) TOPOLOGY: linear 

. <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

20 TCTAAAAGCC CCCTTATACC CCACTTTGTG CAGCAAAGAT CCCCGTGCAG GTCACAGCCT 60 

GATTTGTGGC CAGGCTGGAC AAATTCCTGA GGCACAACTT GGCTTCAGTT CAGATTTCAA 120 

GCTGTGTTGG TGTTGGGACC AGCAGAAGGC AAACGTCCAG CCAACACACA GGACTGTAAG 180 

25 

AGGACTCTGA GCTACGTGCC CTGTGAAGAC CCCCAGGCTT TGTCATAGGA GGTCGTTCAG 240 

CTTCCCCAAA GTCAGAGGTG ATTTGATTTG GGGAAGACTG AATATTCACA CCTAAGT03T 300 

30 GAGCATATCC TGAGTTTTAC TTCCTTATGG CTTGCCCTCC AAGTTCTCTC TCTCATACAC 360 

ACACACACCC TTGCTCCAGA ATCACCAGAC ACCTCCATGG CTCCAGCTAT GGGAACAGCT 420 

GCATTGGGGC TGCCTTTCTG TTTGGCTTAG GAACTTCTGT GCTTCTTGTG GCTCCACTCG 480 

35 

CGAGGCAGCT CGGAGGTGTG GACTCCGATT GGGCTGCAGG CAGCTCTGGG ACGGCACAGG 540 

GCGGGCGCTC TGATCAGCTC GTGTAAAACA CACCGTCTTC TTGGCCTCCT GGCAGTTCTT 600 

40 TCTGOGAATA GTCCTCTCCC TGGCCAGTTG AATGGGGGAA GCTGCTGGCA CAGGAAGGAG 660 

AGGCGATCCC GGCTGAGGCT TAGGAAATTG CTGGAGCCGG CTCCAAGCAG ATAATTCACT 720 

GGGGAGGTTT TCAGAGTCAA ACATCATTCT GCCTGTKTTG GGGGCCAGGT GTGTCACACA 780 

45 

AGCATCTCAA AGTCAAAAGC CATCTGGGGC TGCTGCTTCT CTTTCTCAQG CTCTGGGGAA 840 

AGGAATCTCC CTCTCCTCTC ACTTGATTCC AAGTGTGGTT GAATTGTCTG GAGCACTGQG 900 

50 ACTTTTTTTC TCTTTTCCTT GATGGACCAA CAGTGCAAAT GCAATCTCGC CATTTAACTT 960 

TCAGGTCGAT TTCCTTTCCT GATCAGACAT CTTTGTGCCC CCTTTAGGAA GGAAAAGAAT 1020 

ACACCTACGA TGTGCCAGGC ACTGTGTTAG GCGCTTTTAT ATAGATCCTC GTTAGGATGA 1080 

55 

GACTAAGGGA TGAGGACATC TCTTTATAAA AGGCCCCTAA GTAATGGATA AACAGAAACA 1140 

CTTAGAGGTG AGAAGGTCTG TCTTCAAGAT CCAAGGTAAG ATTGCCTTCA GTCTGATGTT 1200 
60 TGTTCTCAAG GACTTATCCC CTACAATATT CTCCCACTCC ATACTTCTCC TTCTACCCCA . 1260 
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CCATGTGCTC CCX3TGCACTC CTCAGATGGT CAGAGGGGTA ACCCAAGTCC TTAGAGAATT 1320 

TGGGGACXrAA TAGAATATGT GATGTCTGAA TTTTCTTTAA AAAACTTAAG GAGTCTTTGC 1380 

TACCTTCTGC TTCTTGAGTT GTTTTGGCAT TCATATTAAA AGCCAGCATC TCACTATTTA 1440 

TTGACAQGTT GGGCTGTC3TG TGTGCGCATG TGTGTATACA TTTCCAGGCG TCCCTGTXSTC 1500 

CTGTAGCTTT TTAAAAGGAA ACCCAC3TCAT CCCACTATGA ATCTGGCATC TTCTTATGCT 1560 

TCTAGTCTTT TGGCCATACA TCAACCAAGG GGTTTAATTT ATCCAATGCT TGACGACATG 1620 

TTCAGGAGGG GCTGGATCAA ATTTTGAGAG GGTTATGGGA AAGGGAGGGG GAGAAGAAAT 1680 

TGACATTTAT TTTATTATTT ATTTTAAATG TTTACATCTT CTTTATGTTG TATCAAGCCT 1740 

GAATAGAAAC TGATAGCATT AAAATACTCC GTOCCTCTCT CTCTTCTCGC TTCCTTTTTT 1800 

TTTTTTTTTA AATTTAGGAT AACACATTTT TGTTTCTAAA GTGATTTGTG ATTTGTGCTG 1860 

TATAAACTGT ATAAAAGGTT CTGTTTTTAA AGGTGGATTT TCATTCCTCT GGGGACAGTG 1920 

GTCGCCAAGA CATCTACATT GTAAGAGAAC ACAGTGGAAG ATCCTGTCCT GATTCTCAAA 1980 

AATTATTTTC TCTGTATGAT TAAAAGT 2007 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

. iD)-TOPOLOGY^:- linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

CTTTTAACCC TCCCCCTTCA CACACATACA TATCAGGTTG TTTTCTAGTT AAAAACCCAA 60 

GTAGCTCAGA TTCTACTTTA ATGTCAGTGC AGATTTGCAT TGAATCATGC CATTATGTTT 120 

TTTCTCATTT TTATGCTGTT GGGTCTTAGT TTTTAAATTG ATATAAAGAA CTCAGCAATG 180 

GTTTTATTTT CTACTCATAC TTAGGGTTTA GGAAACACTA CCACTAGTTA TCATTTAATC 240 

AACTTCAATG GTCTACTGAA ACAAAAATGG TAACTTTTCA TTAGTGGATT ATTTAGAGTT 300 

ATAGTAGTTG TTTCCAGAAA ACACTTCCTC ACAATTGTAC TTCCCAATCA AATCATGTGA 360 

TCATACAGTT ATTCCCATGA AAGGCAGAAT GTTTGTTTCA AAATTAATCT AGTTTTCTGT 420 

ACATTTAAAT TTGAGAAGGT GACAACTGGC TCTTTTCCAG TCTTCCTTCA TGTCAGTTTT 480 

CTGATAGACC ACTATTGGCA AACACTATCT GTCAACTACC AAATGTGTAA AATTTTCTGT 540 

ATTTCACTTT GTCTTATTTG TAAATAGTGA ACTAAAACTT TTGGCAGATC AGCAACATTT 600 
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GCTGAGCCTG TTTTTTAAGC TAATGTGTAT TCTTACTAAT GTTCCTATCA AGAATGGATT 660 

TGTAATATAT GCTGTCTATT TCTAATGTTC ACATTCATAT TTTGAGGTTC TATCTTATTT 720 

TAATAGAGAA CAGACTTCTC AAAAAATCTT CAGAAGCAGC TTATTATTCA AATATCGAAA 780 

TATTGAAATA AACCCGGTGG GTTAGATTAC TCATCTGTCC ACCAAGTOGG ACATTTCCAT 840 

GGACTGGGGG CTTAAAGGAC TTAGAAGAGA CCTGTAAGTA AATCCTGAAA ATGAGCCAAT 900 

CCCCACTTGA ATGGTTACTG GAGTAAACCC ACCTTTACCA CCCCAATTAC AGCACCXX3AG 960 

GCCGATAAAC CAACTTGGCT CTCGTTCATT TTTCTTTTCT TCATTTGTOA TCCTCAGATT 1020 
CAAAATGTGT GTTCTACACT GTTACAGGCT TCTLTiTlCT TTGATTAAAG ATTTTAGTCC " 1080 

TACTTTTGTA TGGACACATT AGAATATTCA GAGACCAAAA TAGAAGAATT TGCTOTTAGA 1140 

TATTTTTCAG AAGTCAGCAG ATTTGTGGCA AATCATTTAT TTGCCTTTTT AAAAATTCAT 1200 

TTAAGCAGTT CAGAGAGTAG ACTACTCAGA AAATTATTTC ACX5TAATrGT CTAAGAGGTC 1260 

AATATTTTTT AATGCATATT GAATCAAATA A 1291 
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(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GGCACGAGGA CCTACTTTTG TAACAGACCA TGGTTGTGTC 
ATTTTTGGAT GCTTTGTCTG CAATCTTGAC TTGTTTTTGC 
AAATTGTGAA TCTTTTAAAC ATCTTGATAA TTTGTTGTTG 
GTAATGAAAT TCAGTCTAGT TCTGCTGATA AAGATCATCA 
TTCCTCTTCC CTCTTAGTTT TTTACCCAAT ATATGGAGAA 
ATTTTGTTTT AATTGTTTAA TAAAGCTGCT GGGCAGTGGT 
CATAAAAGCA AAATACTTAC ATAGCTTTCT TAAAATATAG 
GAGAAAGTAA GTTGCTTTGC ACCGCCTACT TAATTCCTTT 
TTTTGAATAT GGAATCTTAC TATTTGAATA GAAATGTGTA 
ATAAGCATAT ATGTGTGTGT GTGTGTGTAT ATATATATAT 
CTACACAACA TAAATCACTT TTTAAATTCC AGGAACGGGT 
CTTTTGAGGC TGAATCCGTT ATTAACTTGT TATTTAGGTT 



137: 

CAAGGTAAAA CCACAGTGAT 
AGTATCATTA TTCAGACTTC 
AGAGCTGTTC ATTCTAAAAT 
GTTTTGAAAG GTTACTGATT 
GAGTAATGGT CAATCTTAAC 
GCAGCATTCC TACCTAGTGT 
GAATGACATT ACATTTTTAG 
CCATATATTG TGATACAAAC 
TGTATAATAT ACATACATAC 
ATGCATGCTG TGAAACTTGA 
AGTCTGACAC GGTGATTATC 
TTACTCCCAG TAGCAAGGGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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TTCTAAGTTA GTTGCACTTA CATGATTATT GTGATTTAAA ACTAAGAATA AAGGCTGCAT 780 

TTTCAAAGAT. AAATTGGAAT TGCTGTTGGT GAAATAACAA CCAAAATACT GAATCTGATG 840 

5 

TACATACAGG TTTCTACAGG AAGAGATGGT ATAATTTACA ATTTGGAGAT TTAATAAOCA 900 

GGGCTACCCA GAAAAAGTGA CTTGATAACA TGGTACCAAT AAGTAAQGGA TGCTCTCTCG 960 

. 10 GTTTGCTTTT GCCACTTTCA AGATTTTAAC TTCTCAGGTT ATTAATCAAA ATTATTGTAT 1020 

AAGTTAGCCA ATAGAATTTT TAGGTTAAAA CAACAGATGG GGGGTTTGTG GAGTGTTTAA 1080 

TGTCATGGGC ATTTTTAGTA GCATAGACCC TTTGTTCTGC ATTTGAATGT TTCGTATATT 1140 

15 

TTTGTTTCAC AGTrAATCTT CCCTCCCCAA GTTTGCTATT CAAATCAACT GCCTGAATGA 1200 

CATTTCTAGT AGTCTGATGT ATTTTTCTGA GGAATAGTTT GTGATTCCAA TGCAGGTGTC 1260 

20 TTCATTACCA TTACCTCTAC ACTGCAGAAG AAGCAAAACT CCTTTATTAG AATTACTGCA 1320 

CATGTOTATG GGGAAAATAG TTCTGAAAGG CTAGAATGAT ACAACTTGAGC AAAAGTTGGT 1380 

CAGCTTGGCT ATQGAGTQGT GGCAATAATC TCTAAACATT CXAAAAGACC ATGAGCTGAA 1440 

25 

CXrrAAACTCC CTTGGGAATC TGGAACAAAG GAATATGAAA ATTGCX:ATTT GAAAACTGAC 1500 

CAGCTAATCT GGACCTCAGA GATAGATCAG CCAGTGGCCC AAAGCXZATTT CAAGTACAGA 1560 

30 AATTATAGAG ACTACAGCTA AATAAATTTG AACATTAAAT ATAATTTTAC CACTTTTTGr 1620 

CTTTATAAGC ATATTTGrTAA ACTCAGAACT GAGCAGAAGT GACTTTACTT TCTCAAGTTT 1680 

GATACTGAGT TGACTGTTCC CTTATCCCTC ACCCTTCCCC •TrCCCTTTCC TAAGGCAATA 1740 

35 

• GTGCACAACT TAGGTTATTT TTGCTTCCGA ATTTGAATGA AAAACTTAAT GCCATGGATT 1800 

' lTl ' riC '' X ' ri " i ' GCAAGACACC TGTTTATCAT CTTGTTTAAA TGTAAATGTC CCCTTATGCT 1860 

40 irTGAAATAA ATTTCCTTTT GTAAAAAAAA AAAAAAAAAA AAAAAA 1906 



45 (2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
55 TCTGAACTAA TGCTAACAGA TCCCCCTGAG GGAITCTTGA TGGGCTGAGC AGCTGGCTGG 60 
AGCTAGTACT GACTGACATT CATTGTGATG AGGGCAGCTT TCTGGTACAG GATTCTAAGC 120 
TCTATGTTTT ATATACATTT TCATCTGTAC TTGCACCTCA CTTTACACAA GAGGAAACTA 180 

60 
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4.>.7wwu^xAn i.\Ajrt.i\-.o*^ IVAMI^i^vAL. T. 1 ACiCiTAAGT TGGCAAGTCC 


ATGCTTCCCA 


240 




^-xw«j**i.\,\_x 1 AGTCTACTTC TCTC3CCTATT TTGTATACTC 


TCTTTAATAT 


300 


5 


vai.vjs-^*i«ijK„i 1 iij^aAAAIj i\_ UAGAATGGGT CCCTGGTGCi TTTTTACTTT 


GAAGAAATCA 


360 




J, \.i 1 1 1 AA1jcAAAAL.AA ALj l\jCAA'l'i\5 i 1 1 1 1 iACtG 


GAAAGTTACC 


420 


10 


AiJ(al\3AAi.-AG GACGTAGTTN AGGCCTTCCT GTAAACAGAA 


AATCATATCA 


480 


A x\-v,wii.\, i\jf 1 i i\_i\jAAiv» tX. 1 \jCT ACTIT CTTGTAGATA 


TTTCATTTCA 


540 




ALsi iAAALXX; GTGGATTTTG TAGTTAGGAA CCTC5GGKTCA 


AACCCTCTTC 


600 




(_A(-iAA1TGG CTTATOTCtvtI' QiUACAAGTTl* 'I'l^l'l'lTl'l'lT Tl'ri'lTTTAA 


ACCCTTTCTG 


660 




AACTTTCACT TTCTATGTCT ACX^TCAAAGA ATTGTTGTGA GGCTTGAGAT 


AATGCATTTG 


720 


20 


TAAA(3GGTCT GCCAGATAGG AAGATGCTAG TTATGGATTT ACAAGGTTOT 


TAAGGCTGTA 


780 


AiaAL»i\,iAAA ACCTACAGTG AATCACAATG CATTTACCCC CACTGACTTG 


GACATAAGTG 


840 




AAAACTAGCC AGAAGTCTCT TTTTCAAATT ACTTACAGGT TATTCAATAT 


AAAATTTTTG 


900 




TAATGGATAA TCTTATTTAT CTAAACTAAA GCTTCCTGTT TATACACACT 


CCTGTTATTC 


960 




TQGGATAAGA TAAATGACCA CAGTACCTTA ATTTCTAGGT GGGTGCCTGT 


GATGGTTCAT 


1020 


30 


TGTAGGTAAG GACATTTTCT YTTTTTCAGC AGCTGTGTAG GTCCAGAGCC 


TCTGGGAGAG 


1080 


GAGGGGGGTA GCATGCACCC AGCAGGQGAC TGAACTGGGA AACTCAAGGT TCmTrACT 


1140 




GTGGOGTAGT GAGCTGCCTT TCTGTGATCG GTTTCCXrTAG GGATGTTCCT 


GTTCCCCTCC 


1200 


35 


TTGCTATTCG CAGCTACATA C7VACGTGGCC AACCCCAGTA GGCTGATCCT 


ATATATGATC 


1260 




_ AGTCCrGGTC_ ATAGCCCCAC- CXIAAGCTGGG^ -TATAGGTTTA-^ - - 


- 1320" 


40 


AATTAGGCAA CCTAAAATAT TGATGCTGGT GTTGGTGTGA CATAATGCTA 


TGGCCAGAAC 


1380 




TGAAACTTAG AGTTATAATT CATGTATTAG GGTTCTCCAG AGGGACAGAA 


TTAGTAGGAT 


1440 




ATATGTATAT ATGAAAGGGA GGTTATTAGG GAGAACTGGC TCCCACAGTr 


AGAAGGCGAA 


1500 


45 


GTCGCACAAT AGGCCGTCTG CAAGCTGGGT TAGAGAGAAG CCAGTAGTCG 


CTCAGCXTTGA 


1560 




GTTCAAAAAC CTCAAAACTG GGGAAGCTGA CAGTCCAGCC AGCCTTCACT 


CTGTGGCCAA 


1620 


50 


AGGCCAAGAG CCCCTGGCAA CCAACCCACT GGTGCAAGTC CTAGATTCCA 


AAGGCTGAAG 


1680 


AACCTGGAGT CTGATGTCCA AGAGCAGGAA GAGTGGAAGA AAGCCAGAAG 


ACTCAGCAAA 


1740 




CAAGGTAGAC AGTGTCTACC ACCAYAGTGG CCATACCAAA GAGGCTACCG 


ATTCCTTCCT 


1800 


55 


GCTACCTGGA TCCCTGAAGT TGCCCTGGTC TCTGCACCTT CTAAACCTAG 


TTCTTAAGAG 


1860 




CTTTCCATTA CATGAGCTGT CTCAAAGCCC TCCAATWAAT TCTCAGTCTA 


AGYTTCAAAA 


1920 




AAAAAAAAAA AAAAA 




1935 



60 
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(2) INFORMATION FOR SEQ ID NO: 139: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
NGCCCCCTTG GCACAAGTCA GATGAAGCAC GTTCTGCCGG GGAGGCCCTC AMCTTCCAGA 
GAGGACAGAC ACAGATTTCC TGCTGGGGGA GGGAGGAGTC CACGCATCCT GATGCTGCCT 
GGAAGCTTAT TTTCCCGTGG CCAGGATGCA TTTCTCTGAG TGGAAACAGG TTCTTGCATG 
TGGATGTGTG TTTCCCCAGG CAGACGGCCC CTCTYTTCCC AGCACTTCCC TGCCTCCCCC 
AGGCCTCAGG CCAGCACCCA GTTCCTCCTC ACATGGCAGG TGAGCACAGA CTTCTAGTTG 
GCAGGAGCTG AGGAGGGTGA ACAAACCCCG AGGGAGGCCC GGCCCTTGCT COCGAGTTGG 
GGGGAGGGGG TGTGGCAACG TGCCCCCCGC AGAGGCCACG CATGTTTGAC CAAAGCCCTC 
ATTGTGGTCC GAGGACAGCC TTTTCCCCAG GCCTCARAGC ATTGCTCATC CGTGCCAAAC 
TGGGTAGGTG GATTTGAGCG GAAAGACTCC CAAAATGTGC CAAGAATTTC CCRGTCCCAG 
GCAGGGCAGG GGAAACTAAG GGCAAGCAGG ATACAGGGCG AGGGATGTGG CAGGTGAGGG 
GGCTCCCGCC TGTGCCCCTT CTCCTCACCA TGTCTCCCCC ACCCTGCCTC AGTTCTCCGT 
TCCCCTTCAT CTCCGTCCCC CTCTTTGAAG CTGTCCCCAT CTCAGTGTCA GACCAGCCTT 



CTCCTCAKCT GACCACCCTC CTCTGACCSA CGCCCCCTCC TTGTCTGAAA AAAGGAGCCT 
TGAATGGTGG AGGGAGGCAG TGGGGAGAAA GGTCTCACCG GACAGGTTGG GAGAATGAGG 
TCAGOGGTGC TGGGGAACAG ATGGAGGGGG CAGTGGGGAC AGGGCTTGQG CAGACACCAG 
CAGGAATAAT TTGAAATGTG TGAGGTGACT CCCCGGAGGC CTTGGGCTTG GGCATTTGGG 
AAAAGAATGA TGTCTGGAAG GGCTTAAGGG ACACAGTGGA CGAGGGGAGA GTCCTCATCT 
GCTGGCATTT TGTGGGGTGT TAGTGCCAAA CTTGAATAGG GGCTGGGGTG CTGTCTTCCA 
CTGACACCCA AATCCAGAAT CCCTGGTCTT GAGTCCCCAG AACTTTGCCT CTTGACTGTC 
CCTTCTCTTC CTACCTCCAT CCATGGAAAA TTAGTTATTT TCTGATCdT TCCCCTGCCT 
GGTCTAGCTC CTCTCCAAAC AGCCATGCCC TCCAAATGCT AGAGACCTGG GCCCTGAACC 
CTGTAGACAG ATGCCCTCAG AATTGGGGCA TGGGAGGGGG GSTGGGGGAC CCCATGATTC 
AGCCACGGAC TCCAATGCCC AGCTCCTCTC CCCAAAACAA TCCCGACAAT CCCTTATCCC 
TACCCCAACC dTTGCGGCT CTGTACACAT TTTTAAACCT GGCAAAAGAT GAAGAGAATA 



60 
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1020 
1080 
1140 
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1380 
1440 
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(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC3TH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TTTTTTTTTT TTTGATATGA AATTGTCTTT CTCCATTGCA GAAATAAGCT AGGGAAACAC 60 

TAACCCAAAA ACTTTCTGTA GAGCTGTTCC TTTGGAGGCA GCATCACTTA TTCGCAGTAA 120 

AGACTCAGTA TAAAAGCACC AGCATCCCTA CTTGGGTGAT GGGGATTAAT TTTATAGCAT 180 

TCCATTTTCC TAGTGCCACA TGTGAAATTG GATTTTGATG ATCTTAATCT ATATTCTACC 240 

CTTATAATAA AAGATCAAAA GATATATCTC CTATGAACAG ATTGGAGATA GGAGATGAAA 300 

AGTTGGGAGG ATGTCTTTAT TCTAATGTGA GGGTAGGGAA AATGTGGATA ACATTACTOG 360 

GGTGARGGAG GCATTGTTCT TTAGTTGGAG 1TCTCATTTT TATTCTCCAG TACTGACTTC 420 

TGGGGAAAGC ATACTTTTTC ACTGCCAGGT ACTGAATGCA GAGGCTCAGT GAAGTATATA 480 

TGTGGGAAGT GCATGCATTT CGTTTATTAG CAAACATAGC TGGATTAAGA CAAAGTTGTT 540 

GGTTTGGAAA GGGGTTAAAG CCTTAAGTGA ACAAATCTAG CTAACAGTGA ATGAACTAGG 600 

,TAATATAAGT- TGGATMPFTT— TAATTTGGTT-TGGTTAAAGG-Tee^ ^gO ' 

GGAGACATGA GAAGTATGAT TACTTCAGTG TTAGTTTTCT TAATTTTTTT TTTCCCCTAT 720 

TTGTCCCTTG TCACTTTGTT GCAAGCTAGA AATCTGTGGG TTATACATAG GGCAGCTCTT 780 

TGTGAAAGTG GTTTATTCCA CTGGAGAAAG GGGATTGAAA ATCAGTTAGA ACCAATGTAT 840 

TTCTTGCCCC ACGGAACACT ATTCCTATAA GATAGCTGAA AGAAGCTGCT GTGAGGAGCT 900 

CAGCTCCAAA CACAGGATCA GCACCTTGTA TAGGAATTCC CATGAATTAT GACTTCTCAT 960 

TCTGTTTTAT CAGAGTGCAT ATATGTCCTA CTTCAGGAAA AGTAAAACAG TCATTTACGA 1020 

AAGAAAGTCA ATCTGTATCC TAAGCATTTT AATAAAAAGT TAAAACAAAA AATTAAAAGG 1080 

GACACTCGAG GGGGGGCCCG AAACCCAAT 1109 



60 



{2) INFORMATION FOR SEQ ID NO: 141: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEC3NESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEWENCE DESCRIPTION: SEQ ID NO: 141: 
TAGGACTAAC TTAAATTCTT TTATTCATCT TTTATTTATT AAAAAATTTT ATTTCTTTGA 60 
10 ATTTTCCTGT AATTTCCTTA RGCTCTTCTA TAAAATGTTA TATTCATGTG AACCATACCT 120 
CATTATCCTT AACATTTACT CTCAAAAAGC TTTTTATTTT TATTTTTTTG AAGGTAGTTT 180 
TTCTGTGTGT ACTCTGTAAC ATGATTTTGC TTTCAAATCA TTGTTGTGCC CCCATACAAA 240 

15 

ATGCCTTTTA TTTTTGAGGA TCX3TGGACTT TTTAGTATGG CATGAGTGTG CTAAAAGCCA 300 
GATATCTTTC CACATTCACT GGTGGCTTTG ACACCTAGTT TTTAATCTCC CATCCTTACT 360 
20 TTAAACCCTG ACAGTGCAGT CCTCAGTCAG GGCCAGGACC GGGCTGAGGC CCTTPGTGGA 420 
GATGCTGCAC CACCAGCAGA AGGCTGAGAC CTGGTTACCT GTACCTGTTC ACTTGTAATA 480 
AAAAGAATTA TCTAAAA 497 

25 



30 



(2) INFORMATION FOR SEQ ID NO: 142: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 



40 



50 



(xi) SEQUENCE DESCRIPTICSJ: SEQ ID NO: 142: 

ATGAGGCAGA GGCAAGCTGC CTGCCAACCC CCTCCCTCAA GGAATGGCCT TGCCCAGGAA 60 

TGCCCACCAC ACATACCCTC TTCriTTTTT CTAGTCAAAC TCTTGTTTAT TCCTTGGCTT 120 

GCCTCCCTCC TTTCCTCCCC TCTCAACCTT TTACTTCTGG TTTCTATTTC ATGGGATTTG 180 

45 GGGTTGAAGT TAAACTTACA ACAGTGCCGC CAACACCAAG TCTTGCAGGA AAAAAATACA 240 

AAGAAATTTA ACAAAAAAAA AAAAAAAAA 269 



(2) INFORMATION FOR SEQ ID NO: 143: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
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TTGATTGACT ATGGTCTCTC CGGCTACCAG GAAGAGTCTC CTGAAGTGAA OGCCATNOSJ^ 60 

TTCATCACCT CCACAGCCAT CXTTGCCCCTG CTGTTCGGCT GCCTGGGCGT CTTCGGCCTC 120 

5 

TTCCGGCTGC TGCAGTGGGT GCGCGGGAAG GCXTTACCTGC GGAATGCTGT QGTGGTGATC 180 

ACAGGCGCCA CCTCAGGGCT GGGCAAAGAA TGTGCAAAAG TCTTCTATGC TGCGGGTOCT 240 

10 AAACTGGTGC TCTGTGGCCG GAATGGTGGG GCCCTAGAAG AGCTCATCAG AGAACTCACC 300 

GCTTCTCATG CCACXAAGGT GCAGACACAC AAGCCTTACT TGGTGACCTT OGACCTCACA 360 

GACTCTGGGG CCATAGTTGC AGCAGCAGCT GAGATCCTGC AGTGCTTTGG CTATGTCGAC 420 

15 

ATACTTGTCA ACAATGCTGG GATCAGCTAC CGTGGTACCA TCATGGACAC CACAGTOGAT 480 

GTOGACAAGA GGGTCATGGA GACAAACTAC TTTGGCCCAG TTGCTCTAAC GAAAGCACTC 540 

20 CTGCCCTTCCA TGATCAAGAG GAGGCAAGGC CACATTGTCG CCATCAGCAG CATCCAGGGC 600 

AAGATGAGCA TTCCTTTTCG ATCAGCATAT GCAGCCTCCA AGCACGCAAC CCAGGCTTTC 660 

TTTGACTGTC TGCGTGCCGA GATGGAACAG TATGAAATTG AGGTGACCGT CATCAGCCCC 720 

25 

GGCTACATCC ACACCAACCT CTCTGTAAAT GCCATCACCG CX3GATGGATC TAGGTATGGA 780 

GTTATGGACA CCACCACAGC CCAGGGCCGA AGCCCTGTGG AGGTGGCXrCA GGATGTTCTT 840 

30 GCTGCTGTGG GGAAGAAGAA GAAAGATGTG ATCCTGGCTG ACTTACTGCC TTCCTTGGCT 900 

GTTTATCTTC GAACTCTGGC TCCTGGGCTC TTCTTCAGCC TCATGCCTCC AGGGCCAGAA 960 

AAGAGCGGAA ATCCAAGAAC TCCTAGTACT CTGACCAGCC AGGGCCAGGG CAGAGAAGCA 1020 

35 

GCACTCTTAG GCTTGCTTAC TCTACAAGGG ACAGTTGCAT TTGTTGAGAC TTTAATGGAG 1080 

ATTTGTCTCA CAAGTGGGAA AGACTGAAGA AACACATCTC GTGCAGATCT GCTGGCAGAG 1140 

40 GACAATCAAA AACGACAACA AGCTTCTTCC CAGGGTGAGG GGAAACACTT AAGGAATAAA 1200 

TATGGAGCTG GGGTTTAACA CTAAAAACTA GAAATAAACA TCTCAAACAG TAAAAAAAAA 1260 

AAAAAAAAC 1269 

45 



<2) INFORMATION FOR SEQ ID NO: 144: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
55 <D) TOPCWjOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
AAAAGGCAAA CTATAGGATA ACACAGAGCC CTTTTTGAAA ATAAATTGGC ATTGGAGTGT 60 
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368 

TTTACCCTCT AGCTGTTTTA CTTAGAATGT AACATATGCT GCCTACCCAC CTCAAAATGT 120 

CTGTACTGCA AGAGGGCXXTT GGGCXTTCTGC TTTCCATATT CACGTTTGGC CAGAGTTGTA 180 

5 GTCCXIAAAGA AGAGCATGGG TGGCAGATGG TAGGGAATTG AACTQGCCTG TGCAATGGGC 240 

ATGGAGCACA AGGGGTCACA GCATGCCTCC TGCCTTACCG TGGCAGTACG GAGACAGTCC 300 

AGAACATGGT CTTCTTGCCA CGGGGTCTTG TTGTCTCTGG TGGTGCTGCA TGTCTGTOGC 360 

10 

TCACCTTTAT TCTTGAAACT GAGGTTTACC TGGATCTGGC TACTGAC3GCT AGAGCCCACA 420 

GCAGAATGGG GTTGGGCCTG TGGCCCCCAA ACTAGGGGGT GTGGGTTCAT CAGAGTGTTG 480 

15 CCTTTTGTCT CCTAAAGATA GGGATCTACT TTTGAAGGGA ATTGTTCCTC CXAAATAAAT 540 

TTGCTTTACC TTGGTCCTTT CTTTTGTGCC AGTATTCAAG TGGTATAGCTT CTGAGCAGGG 600 
TCACATTTGG CCAAACCTGA CACTGTCTTG CTGCATTCTC CTTTGGCAAA CATCAGGGTC 660 

20 

AGAATTCAGG ATAGCCCTTC CTAGQGCACT GGACTTTCTG GCATGGGGGC TGTGTTTGCA 720 
CAAGTTATTT TCATGTTACC TGGAGAGTCT CCAGAGGCTG CTCTGAGGCT GAGGTGTGTT 780 
25 ccccx:rrTGCC TGGTTCCAGC TGTCAGAGGG ATACCATCCT AGGGTCTGGG AATCCAAGGC 840 
CACGAGACTC CTTGGTTTGT GGTCCGAGAT CXrTGTACTAA GGAGGGTCTG GCXy\GAGGAA 900 

A. 

CAGACCAGCT TTTGCACAAT GAAGCGCAAG GGAACAAGTG GTTTGCCTGG TGTCCTACCT 960 

30 

GTCCTGAACC TGGTCCTGTG GGCCATTGAA AAGTTAGATC TGTGATCTCT GGGGTTTTTG 1020 

TGGCTTTGTT CAATGCTTCC ACTCTAGGGC AGGCAGAGCA GTCTATACTC TCCGAAGCCT 1080 

35 GCTTGACCTC CAAGTAGAGC TGATACAGAG ATCTGTGAAT ATTGTGATAG AAATTCTTTG 1140 

~ ~GTAOT^^MAC~^?OTTCAGCTG~CAfi^^ 1200 

AGTCATTCTT AAAGACAGAG GATAGCTGTG ACTCATGGGA TCATGAGGTC CATQGCTQGT 1260 

40 

TGCAGGTTCC CTTTTTCCTT CCTCAQGTTT TGTCTCTTCC TGTGTTGTCC CCAGCAAGGG 1320 

AGAGACTGTG GGGTGGATTG GGAG7ACAGA TTAQGAGTAT AGCAAATGAA CCCAGAATGG 1380 

45 AACAGTGGGG AGCTAACTGT GAATGAGGAG AGTACCTGCT GCAGGACCTG GAGGTCAGGT 1440 

GTGAATGCTG TATTGGCACA GGGAATAAAT ATCCTGGCGT CTGGAGCXnT CACCTCTCCG 1500 

TCAAGTCCTT CCTGTGATAC TGCCATGGCA CAGGATCTGA GTTGCAGCTC TGCACCCTAA 1560 

50 

ATCACACCCT GGGCATTGTC TGGGCTGCAG GGCTGCCAGG TTCTGTACTT GTGTCCAGCT 1620 

GTGGCCCTGG ATGCTGGAGC TGGAGGGTTT TCTGTGCTCA GACTGTAGCC TGTAGCTCTT 1680 

55 GQCCTGTGTA GAGCCCCCTC CTGTGCCCTC AGTGGCTGTC GTTTGTTAAC ATCATCAGGA 1740 
AGATGGGAAA GGTCAGGCAG AATTTTTCTG CCCTACAAAG GGTGGAAGAG AAAGGACACA 1800 
GTATTTTCAT GAATTTACCA TATATCTTTG TTTTTCTTCA ACGAAAAAGT TAATTGAGGC 1860 

60 
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AATGTCATCT GCTCAAAGTT GAGTGGTTTA TTCACAATAA ACTGTAAGTT TCTGATTATA 1920 
AAAAAAAAAA AAAAAAAAAA AAAG 1944 

5 

(2) INFORMATION FOR SEQ ID NO: 145: 

10 (i) sbqueik:e characteristics: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOUDGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
TCGACCCACG CGTCCGGGGT GCGCAACGGG GAGTTCCGGC TGGAGACCCG TGCTCTGGGC 60 
20 CGGCGCCTTC ACCATGGCCT CGGCAGAGCT GGACTACACC ATCGAGATCC CGGATCAGCC 120 
CTGCTGGAGC CAGAAGAACA GCCCCAGCCC AGGTGGGAAG GAGGCAGAAA CTCGGCAGCC 180 
TGTGGTGATT CTYTTGGGCT GGGGTGGCTG CAAGGACAAG AACCTTGCCA AGTACAGTGC 240 

25 

CATCTACCAC AAAAGGGGCT GCATCGTAAT CCGATACACA GCCCCGTGGC ACATGGTCTT 300 
CTTCTCCGAG TCACTGGGTA TCCCTTCACT TCGTGTTTTG GCCCAGAAGC TGCTCGAGCT 360 
30 GCTCTTTGAT TATGAGATTG AGAAGGAGCC CCTGCTCTTC CATGTCTTCA GCAACGGTGG 420 
CGTCATGCTG TACCGCTACG TGCTGGAGCT CCTGCAGACC CGTCGCTTCT GCCGCCTGCG 480 
TGTGGTGGGC ACCATCTTTG ACAGCGCTCC TGGTGACAGC AACCTGGTAG GGGCTCTGCG 540 
GGCCCTGGCA GCCATCCTGG AGCGCCGGGC CGCCATGCTG CGCCTGTTGC TGCTGGTGGC 600 



35 



55 



CTTTGCCCTG GTGGTCGTCC TGTTCCACGT CCTGCTTGCT CCCATCACAG CCNTCTTCCA 660 

40 CACCCACTTC TATGACAGGC TACAGGACGC GGGCTCTCGC TGGCCCGAGC TCTACCTCTA 720 

CTCGAGGGCT GACGAAGTAG TCCTGGCCAG AG7VCATAGAA CGCATGGTGG AGGCACGCCT 780 

GGCACGCCGG GTCCTGGCGC GTTCTGTGGA TTTCGTGTCA TCTGCACACG TCAGCCACCT 840 

45 

CCGTGACTAC CCTACTTACT ACACAAGCCT CTGTGTCGAC TTCATGCGCA ACTGCGTCCG 900 

CTGCTGAGGC CATTGCTCCA TCTCACCTCT GCTCCAGAAA TAAATGCCTG ACACCTCCCC 960 

50 ACAAAAAAAA AAAAAAAAAA ACTCGAGGGG GGGCCCGGTA CCCAATTCGC CCTATAAAGG 1020 

T 1021 



(2) INFORMATION FOR SEQ ID NO: 146: 



<i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1285 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

GGCACGAGGA GGGCCACGGC AGCCATCGCG CTTTGCAGTT CGGTCTCCTG GTGTACGGOC 60 

AACGCCAAGT AGGGGATTGC GTTCCCTCCA GTCGCAGACC CTATCAGATT TGGATATGTC 120 

CTTCATATTT GATTGGATTT ACAGTGGTTT CAGCAGTGTG CTACAGTTTT TAGGATTATA 180 

TAAGAAAACT GGTAAACTGG TATTTCTTGG ATTGGATAAT GCAGGAAAAA CAACATTGCT 240 

15 ACACATGCTA AAAGATGACA GACTTGGACA ACATGTCCCA ACATTACATC CCACTTCCGA 300 

AGAACTGACC ATTGCTGGCA TGACGTTTAC AACTTTTGAT CTGGGTGGAC ATGTTCAAGC 360 

TCGAAGAGTG TGGAAAAACT ACCTTCCTGC TATCAATGGC ATTGTATTTC TGGTGGATTG 420 

20 

TGCAGACCAC GAAAGGCTGT TAGAGTCAAA AGAAGAACTT GATTCACTAA TGACAGATGA 480 

AACCATTGCT AATGTGCCTA TACTGATTCT TQGGAATAAG ATCGACAGAC CTGAAGCCAT 540 

25 CAGTGAAGAG AGGTTGCGAG AGATGTTTGG TTTATATGGT CAGACAACAG GAAAGGQGAG 600 

TATATCTCTG AAAGAACTGA ATGCCCGACC CTTAGAAGTT TTCATGTGTA GTGTGCTCAA 660 

AAGACAAGGT TACGGAGAAG GCTTCCGCTG GATGGCACAG TACATTGATT AACACAAACT 720 

30 

CACATTGGTT CCAGGTCTCA ACGTTCAGGC TTACTCAGAG ATTTGATTGC TCAACATGCA 780 

TAACTTGAAT TCAATAGACT TTTGCTGGTT ATAAAACAGA TGTTTTTTAG ATTATTAATA 840 

35 TTAAATCAAC TTAATTTGAA TGAGAATTGA AAACTGAITC AAGTAAGTTT GAGTATCACA 900 



40 



■ MCTTAGCTO~TCrAATTC TAAAW3TACT TOGTTTTTAC AGTTTATAAT CTGACATCAC 960 

CCCAGCGCCA TTTGTAAAGA GCAACTTTCC AGCAGTACAT TTGAAGCACT TTTTAACAAC 1020 

ATGAAACTAT AAACCATATT TAAAAGCTCA TCATGTTAAA TTTTTTATGT ACTTTTCTGG 1080 

AACTAGTTTT TAAATTTTAG ATTATATGTC CACCTATCKT AAGTGTACAG TTAATAATTA 1140 

45 GCTTATTCAA TGATTGCATG ATGCCTTACA GmTCAATA ACTTTTTTTC TTATGCAAAC 1200 

GTCATGCAAT AAAACAAACT CTAATGTTTG GCAAAAAAAA AAAAAAAAAA NTCGAGGGGG 1260 

GGCCCGTACC CAATTCGCCC TAAAG 1285 

50 



55 



(2) INFORMATION FOR SEQ ID NO: 147: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 base pairs 

(B) TYPE: nucleic acid 
(C> STOANDEDNESS: double 

60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GGCACGAGGT GGCGCAGGGG TCAGTGGTTC TCTCGGGTCT CGGGACAGGT GAGCACCCTG 60 

5 

ATGAAGGCCA CGGTCCTGAT GCGGCACCTG GGCGGGTGCA GGAGATCCTG GGCGCCCTCC 120 

GCAAGGGCGS CGGAGACCGG TTACAGGTGA TTTCTGATTT TRACATGACC TTGAGCAGGT 180 

10 TTGCATATAA TQGAAAGCGA TGCCCTTCTT CTTACAATAT TCTQGATAAT AGCAAGATCA 240 

TCAJC5TGAGGA GTGTCGGAAA GAGCTCACAG CGCTCCTTCA CCACTATTAC CCAATTGAGA 300 

TCGACCCACA CCGGACCGTC AAGGAGAAGC TACCTCATAT GGTGGAATGG TGGACCAAAG 360 

15 

CGCACAATCT CCTATGTCAG CAGAAGATTC AGAAGTTTCA GATAGCCCAG GTGGTTAGAG 420 

AGTCCAATGC AATGCTCAt3G GACXX3ATATA AGACCTTCTT CAACACACTC TACCATAACA 480 

20 ACATTCCCCT TTTCATCTTT TCTGCGGGCA TTGGTGATAT CCTGGAAGAA ATTATCCGAC 540 

AGATGAAAGT GTTCCACCCC AACATCCACA TCGTGTCTAA CTACATQGAT TTTAATGAAG 600 

ATGGTTTTCT CCAGGGATTT AAGGGCCAGC TGATACACAC ATACAACAAG AACAGCTCTG 660 

25 

TGTGTGAGAA CTSTGGTTAC TTCCAGCAAC TTGAGGGCAA AACCAATGTC ATCCTGCTGG 720 

GAGACTCTAT CGGGGACCTC ACCATGGCCG ATGGGGTTCC TGGTGTGCAG AACATTCTCA 780 

30 AAATTGGCTT CCTGAATGAC AAGGTGGAGG AGCOGCGQGA NCGCTACATG GACTCCTATG 840 

ACATCGTGCT GGAGAAGGAC GAGACTCTGG ATGTCGTCAA CGGGCTACTG CAC5CACATCC 900 

TGTGCCAGGG GGTCCAGCTG GAGATGCAAG GCCCCTGAAG GCGCAGGCTN CCAGNCCGCC 960 

35 

TGCAGGCCGT GGTX3AGGAGG GGCGCCTCCC CAGAGTCTGC TCCCCCGTGA ACACAGAGCA 1020 

GANGCCAGGG TGGCCAGCAG TGGCTGGGTC CTTCCGCGCC CCTCCGTCCT CCTTTCCCTG 1080 

40 AGCACCTTCA TCACCAGAGG CTTGAAGGAA CCCCGCCATG TGGCAGGGCA CAGGCACTGT. 1140 

TCCTGGTGAA CCTTGGACCA CAGCATGTCA GTGCTCTAGG GATTGTCTAC TCCAGGGATT 1200 

TTCTTCAAAA TTTTTAAACA TGGGAAGTTC AAACAAATAT AATGTGTGAA ACAGATCAAA 1260 

45 

ATTTTTAAAA TGAAAAAAAA GCTGCTCTGA TTCAGGGGAT GTGGGTCGGG GTAGAACCTG 1320 

GACCTCTTGG CCTGGGGGCA CATGGGATGC TTCTAGGAAC ACAGTTTGAG AACCACCAAA 1380 

50 AAAAAA 1386 



55 (2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE2«3TH: 2098 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 



tSDOaO: <WO_9839448A^_L> 



wo 98/39448 



372 



PCT/US98/04493 



10 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

AGCCCTTCTC CCCGCGCTTG GGACTCTGAC ATCTTAAGGC TGCACGGTCG TC3TCCTTGTC 60 

TGGGTGAGGC CATGTCTGTG ATCCAAGGTT CCTGGAACTG ACACAGGAAG GGGCTGTGAA 120 

CCCTAAGTGG GTC3TMATCTC CTCCRACCGA GGCTTCTOAC CCTGGAGATG GCAGTTACTC 180 

CTGGCCATGG TTGCTGAGCA TGGGCAGACC AGTGGAGGCC ACCCTACTGT GTTATCTGCG 240 

CCTTCRATGA AGTGAGACCC TTGGGGAGAA CGGGCTGTGG ATGAAGGAC5T GGACTGCAGC 300 

15 CTTGGCCTAG CCACTGGGCT GGGATCTTCT GGGTCATGTG ACTGTGTATC CAGGAGCAGA* 360 

AACTTGTATT CTCAGGATTC AGGATCTACC CAGCACCAAA GATGTATTTT CAGGAGAACA 420 

GACCTAGAAA TGGGCCTGTC TGGCATTTCA GAGTCAGGCA AAGCAGGCAG GGCCAGQGAG 480 

20 

CTTCTGTGGG TCTACACAAG AAGGTTCCTG TGAGGGCTAT CAGTTGTTGC CTTCTAGdT 540 

GCTGGrrAACT TTQGCGCCTC CGCCAAGCCC TGCCAGACTC CCCTGGCTGT GATQGCATTC 600 

25 TCTGCCATCC TGCCTTGTCC CCAGCCTCTG CAGGATGCCC TCCCTACCCA MCTYTYCCTG 660 

GGCCTTCCCT GTCCACTGGG CTGGATTCAT GTTCAAACCA CTGGACTGGC AGGGCAACGA 720 

CTTCITCCCA CCTCAAGATG AGGTCCTCGC CCCCTTGTCT TGGCATAAAA ACACCTTTAA 780 

30 

AGCATGAGCC ATGTGCTTCT TTGCCdTCT CTGTCCTGTT CCAATCTTCT GCCTCCCAGT 840 

CACTCCCTGG GGACTATQGG ATCACTGTCC CCCCACCTGT GTGGCCACAC CATGTGTCCT 900 

35 GTCAATCCAG AACTGCCTCT GAGCTCCAGG CTGACCACAG ATCAGCCACA GCCTGATGCC 960 

TGCAGCCCCA~CTrTGCTCAr~CCTTCCCCTC~CCCTCCTCCT "TCCTTCCACA~CAGCSAGCCT lOZO" 

ACCTTTYTCC ATCCATGCTC ACCATAGCCC CCTTCCTTGT GACCTGGACC CTCCATTGTA 1080 

40 

CCTGGCTGAG ACTGTCAGCC TCCTGGAGGA GTGGGGTCCA CCTTCTTCTT GCCCTATGCA 1140 

GTCCAAGCTT CACTTCTCAC CCAGCAAGGT TGACTCATCT GCCTCCATGT CTCTGGGGCT 1200 

45 TTGCTGITGC CCTGAAACCT AGCTGGGCTG GTCTTGCTCC CAGCTTGCTT CCCCCTCCTC 1260 

GGATGTCCCT TTGCAGGCCC CTGTCGTTCC TCCGGCACCA GTGTCCTTGG CTGCCATGGC 1320 

AAGCTCATCA GGGGCTTGTA CCCTGGTCAC CAAGCATGGT AGCAGCTGCC TGCATTGTAT 1380 

50 

CTCCATCTGG TCACTGCAGG TGCCAACCCT TCATCCCCCA TGTTTTCCTG GGCCATGGAG 1440 

GQCTGACCTC CGTTrCTGGG GAATGTGGCT GAGCTGTGGT AACCAGCTAC ACCCCAGGTG 1500 

55 CTCTITCCAT GGTGGTGCCT GCTCATCTTG CTGATGCAAA CTAGGAAGTT AGGCTGCATC 1560 

TCGGAGTCGC TTTCGCTGGA GAGGTGCTTT GCTGTCTCTC AGACTCAGTC ACTGTGTTCC 1620 

CTCCCCGCCT CTCTTATCTC CATGGCTGTT TGCAGCTCTC CCAGGTACTT TGGGGTCTGA 1680 
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15 



GCTGGAATTC CTTTGTGGTT TGCTCTTCTG CTTCTCACTC TTGTATTAAG AAGGATTCCA 1740 

CAAAGGGAGA GTOGCATCCC TGCTGCTGCT GTGCCAGACC AGAGTTTCCT GAGGGGCCCT 1800 

GACCCTAACr CTCCAGCTCA GCCCTGTACA CCTGACCCTG TTU^ATGAGTG GGGTTTGCTG 1860 

ACTGTAATCC CTGACACCAG TAAAACCAAA AGGACTCTTG GGGGCTCAGT GTGAGAGCCA 1920 

GGGTTACCTA CTCTGCXT^ TGAQGACAAA CTGCTAGGCT GTATCCCATA ATTTCAGGAT 1980 

GAGAAACATT AACAATAAAA ATTTGTAGTA AACATAACCT CATGANGACT AAAAAAAAAA 2040 

AAAAACTYGG GGGQGGGCCC GTAACCCATT GGGCCCTTNG GGGGGC»I3TT TTAAAATT 2098 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENR3TH: 1847 bcise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 149: 
TCGACCCACG CGTCCGAACT GAGGCGGCGG CGGGAGCCGG TTGGKGTCTG GTCTTCGCGT 
CGGCCCCGCG GACCAGACGC TGCCCCCGGC GCGGGGAGAA GATGGTGCCK AGCGGCCTCG 
GGCCCGCCAC GCGCCGCCAC GAGTGAGCCC AGCGCGACCG CGGGCGTCCG CCGAGCAGCT 
GGCCCGGCTG GGCCCGGGGC GCGCANTGCC CGCCGGGGCG GGGTGGAGCT GATCAGAATA 
ATGTTCAGCA TCAACCCCCT GGAGAACCTG AAGGTGTACA TCAGCAGTCG GCCTCCCCTG 



GTGGTCTTCA TGATCAGCGT AANGCCCATG GCCATAGCTT TCCTGACCCT GGGCTACTTC 
TTCAAAATCA AGGAGATTAA ATCCCCAGAA ATGGCAGAGG ATTGGAATAC TTTTCTGCTA 
CGGTTCAATG ATTTGGACTT GTGTGTATCA GAGAATGAAA CCCTCAAGCA TCTCACAAAC 
GACACCACAA CTCCGGAAAG TACAATGACC AGCGGGCAGG CCCGAGCTTC CACCCAGTCC 
CCCCAGGCCC TGGAGGACTC GGGCCCGGTG AATATCTCAG TCTCAATCAC CCTAACCCTG 
GACCCACTGA AACCCTTCGG AGGGTATTCC CGCAACGTCA CCCATCTGTA CTCAACCATC 
TTAGGGCATC AGATTGGACT TTCAGGCAGG GAAGCCCACG AGGAGATAAA CATCACCTTC 
ACCCTGCCTA CAGCGTGGAG CTCAGATGAC TGCGCCCTCC ACGGTCACTG TGAGCAGGTG 
GTATTCACAG CCTGCATGAC CCTCACQGCC AGCCCTQGGG TGTTCCCCGT CACTGTACAG 
CCACCGCACT GTGTTCCTGA CACGTACAGC AACGCCACGC TCTGGTACAA GATCTTCACA 
ACTGCCAGAG ATGCCAACAC AAAATACGCC CAAGATTACA ATCCTTTCTG GTGTTATAAG 
GGGGCCATTG GAAAAGTCTA TCATGCTTTA AATCCCAAGC TTACAGTGAT TGTTCCAGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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20 



25 



30 



35 



40 



45 



50 
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GATGACCGTT CATTAATAAA TTTGCATCTC ATGCACACCA GTTACTTCCT CTTTCTGATG 
GTCATAACAA TGTTTTGCTA TGCTC3TTATC AAGGGCAGAC CTAGCAAATT GCX5TCAGAGC 
AATCCTGAAT TTTGTCCCX5A GAAGGTGGCT 1TGGCTGAAG CXTTAATTCCA CAGCTCCTTG 
TTriTTGAGA GAGACTGAGA GAACCATAAT CCTTGCCTGC TGAACCCAGC CTQGGCCTGG 
ATGCTCTGTG AATACATTAT CTTGCGATGT TGGGTTATTC CAGCCAAAGA CATITCAAGT 
GCCTGTAACT GATTTGTACA TATTTATAAA AATCTATTCA GAAATTGGTC CAATAATGCA 
CXrrGCTTTGC CCTOGGTACA GCCAGAGCCC TTCAACCCCA CCTTGGACTT GAGGACCTAC 
CTGATGGGAC GTTTCCACGT GTCTCTAGAG AAGGATTCCT GGATCTAGCT GGTCACGACX3 
ATGTTTTCAC CAAGGTCACA GGAGCATTGC GTCGCTGATG GGGTTGAAGT TTGGTTTGGT 
TCTTGTTTCA GCCCAATATG TAGAGAACAT TTGAAACAGT CTGCACCTTT GATACGGTAT 
TGCATTTCCA AAGCCACCAA TCCATTTTGT GGATTTTATG TGTCTGTGGC TTAATAATCA 
TAGTAACAAC AATAATACCT TTTTCTCCAT TTTGCTTGCA GGAAACATAC CTTAAGTTTT 
TTTTGTTTTG TTTTTGTTTT TTTGTTTTTT GTTTTCCTTT ATGAAGAAAA AATAAAATAG 
TCACATTTTA ATACTACCAA AAAATGGACA AAAAAAGTCG AGGGGGG 

(2) INFORMATION FOR SEQ ID NO: 150: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 base pairs 
(^a)— TYPE^Tiucleic"ac±d 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1847 



(xi) SEQUENCE DESCRIPTIOa: SEQ ID NO: 150: 

GACGCTGACG AGAGAAGGCC TCTTCCTTGA GGGTTGGTGC TGTGTTGCAG TGACCGTGGC 60 

GGATTACGCC AACTCGGATC CGGCGGTCGT GAGGTCTGGA CGAGTCAAGA AAGCCGTAGC 120 

CAACGCTGTT CAGCAGGAAG TAAAATCTCT TTGTGGCTTG GAAGCCTCTC AGGTTCCTGC 180 

AGAGGAAGCT CTTTCTGGGG CTGGTGAGCC CTGTGACATC ATCGACAGCA GTGATGAGAT 240 

GGATGCCCAG GAGGAAAGCA TCCATGAGAG AACTGTCTCC AGAAAAAAGA AAAGCAAGAG 300 

ACACAAAGAA GAACTGGACG GGGCTGGAGG AGAAGAGTAT CCCATGGATA TTTGGCTATT 360 

GCTGGCCTCC TATATCCGTC CTGAGGACAT TGTGAATTTT TCCCTGATTT GTAAGAATGC 420 

CTGGACTGTC ACTTGCACTG CTGCCTTTTG GACCAGGTTG TACCGAAGCA CTACACGCTG 480 

GATGCTTCCC TGCCTTTGCG TCTGCGACCA GAGTCAATGG AGAAGCTGCG CTGTCTCCGG 540 
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GCTTGTGTGA TCCGATCTCT GTACCATATG TATGAGCCAT OTGCTGCTCG AATCTCCAAG 600 

AATCCAGCCA TTCCAGAAAG CACCCCCAGC ACATTAAAGA ATTCCAAATG CTTACTTTTC 660 

TGGTGCAGAA AGATTGTTGG GAACAGACAG GAACCAATGT GGGAATTCAA CTTCAAGrTC 720 

AAAAAACAGT CCCCTAGGTT AAAGAGCAAG TGTACAGGAG GATTGCAGCC TCCCGTTCAG 780 

TACGAAGATG TTCATACCAA TCCAGACCAG GACTGCTGCC TACTGCAGGT CACCACCXTTC 840 

AATTTCATCT TTATTCCGAT TGTCATGGGA ATGATATTTA CTCTGTTTAC TATCAATGTG 900 

AGCACGGACA TGCQGCATCA TCGAGTGAGA CTGGTGTTCC AAGATTCCCC TGTCCATGGT 960 
GGTCGGAAAC TGCX3CAGTGA ACAGGGTGTG CAAGTCATCC TGGACCCAGT GCACAGCGTT ' 1020 

CGGCTCTTTG ACTGGTGGCA TCCTCAGTAC CCATTCTCCC TGAGAGCGTA GTTACTGCTT 1080 

CCX:ATCCCTT GGGGGCAGCC TOGACSTGTAG TCCATTAGTA ATCAGATTCC AGTTTGGACA 1140 

GGGTGGCTGG ATTGTATATC TCGTTAGTAA TX5TACATGCT CTTCAGGTTC TAGGGCTCCT 1200 

GTTAGGGGAG GGAGAAATGT TGAATCAAGA GGGAAAACAA CTACTATGAT TTATAAACAT 1260 

ATTTTAATGT AAAAATTTGC ATTTAAAAGG AGTGGCCCTG TTTTCTGTGT TAAAACCCCA 1320 

TTTGGTGCTA TTGAGTTTGT TCTTTATTCT TTTATCCCAG TGAAAATTGT TGATCTTGCT 1380 

GTAGGGAAAA ATTAAACTCT TTGAATCTCC AAACAAGGAA GTTTCAGCAT TCCCTTATOG 1440 

ATCAGAGGAA CCTTAGAGGC CTGAAATTGT TGCTTCCAGT TTAGCTGCCX: CTCAAATTCA 1500 

AGTGAATATT TTCCCTTCTC CXriTTACCCT TCTCCAGAAA TAAAGCAGGT GACAGGGTTT 1560 

CAGAATCTT 1569 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LiENGTH: 1540 beise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

CCCACGCGTC CGGAAGGATT GACCAGTTAA CCAACATCTT AGCCCCCATG GCTGTTGGCC 60 

AGATTATGAC ATTTGGCTCC CCAGTCATCG GCTGTGGCTT TATTTCGGGA TGGAACTTGG 120 

TATCCATGTG CGTGGAGTAC GTCCTGCTCT GGAAGGTTTA CCAGAAAACC CCAGCTCTAG 180 

CTGTGAAAGC TGGTCTTAAA GAAGAGGAAA CTGAATTGAA ACAGCTGAAT TTACACAAAG 240 

ATACTGAGCC AAAACCCCTG GAGGGAACTC ATCTAATGGG TGTGAAAGAC TCTAACATCC 300 

ATGAGCTTGA ACATGAGCAA GAGCCTACTT GTGCCTCCCA GATGGCTGAG CCCTTCCGTA 360 
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CCTTCCXSAGA TGGATGGGTC TCCTACTACA ACCAGCCTGT GrrTCTGGCT GGCATQGGTC 420 

TTGCTTTCCT TTATATGACT GTCCTGGGCT TTGACTGCAT CACCACAGGG TACGCCTACA 480 

5 

CTCAGGGACT GAGTGGGTTC CATCCTCACT ATTTTGATGG GAGCATCAGC TATAACTGGA 540 

ATAATQGGAA CTGTAGCTTT TACTTGGCTA CGTCGAAAAT GTOGTTTCGT TCGGCAGGTC 600 

10 TGATCTCAGG ATTQGCACAG CTTTCCTGTT TGATCTTGTG TGTGATCTCT GTATTCATGC 660 

CTOGAAGCCC CCTGGACTTG TOCGTTTCTC CTTTTGAAGA TATCCGATCA AGGTTCATTC 720 

AAGGAGAGTC AATTACACCT ACCAAGATAC CTGAAATTAC AACTGAAATA TACATGTCTA 780 

15 

ATGGGTCTAA TTCTGCTAAT ArTGTCCCGG AGACAAGTCC TGAATCTC3TG CCCATAATCT 840 

CTGTCAGTCT GCTGTTTGCA GGCGTCATTG CTGCTAGAAT CGGTCTTTGG TCCTTTGATT 900 

20 TAACTGTGAC ACAGTTGCTG CAAGAAAATG TAATTGAATC TGAAAGAGGC ATTATAAATG 960 

GTGTACAGAA CTCCATGAAC TATCTTCTTG ATCTTCTGCA TTTCATCATG GTCATCCTQG 1020 

CTCCAAATCC TGAAGCTTTT GGCTTGCTCG TATTGATTTC AGTCTCCTTT GTGGCAATGG 1080 

25 

GCCACATTAT GTATTTCCX3A TTTGCXTCAAA ATACTCTGGG AAACAAGCTC TTTGCTTOCX5 1140 

GTCCTGATGC AAAAGAAGTT AGGAAGGAAA ATCAAGCAAA TACATCTGTT GTTTGAGACA 1200 

30 GTTTAACTGT TGCTATCCTG TTACTAGATT ATATAGAGCA CATGTGCTTA TTTTGTACTG 1260 

CAGAATTCCA ATAAATQGCT GGGTGTTTTG CTCTOTTTTT ACCACAGCTTG TGCXTTTGAGA 1320 

ACTAAAAGCT GTTTAGGAAA CCTAAGTCAG CAGAAATTAA CTGGATTAAT TTCCCTTATG 1380 

35 

TTGAGGGCCA TGGRA AAAA A AT TGGGAAA A GGAAAA ACTC AGTTTTAAAT A CGGGA GACT 1440^ 

ATAATGGATA ACACTGRATT CCXCTATTTC TCATGAGTAG ATACAATCTT ACGTAAAAGA 1500 

40 GTGGTTAGTC ACGTGAATTC AGTTATCATT TGACAGATTC 1540 



45 (2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1719 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 



55 TACTTATGAG GTCAATTGGA AATAAGAACA CCATTTTACT GGGTCTAGGA TTTCAAATAT 60 

TACAGTTGGC ATGGTATGGC TTTGGTTCAG AACCTTGGAT GATGTGGGCT GCTGGGGCAG 120 

TAGCAGCCAT GTCTAGCATC ACCTTTCCTG CTGTCAGTGC ACTTGTTTCA CGAACTGCTG 180 

60 
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ATGCTGATCA ACAGGGTGTC GTTCAAGGAA TGATAACAGG AATTCGAGGA TTATGCAATG 240 

GTCTGGGACC GGCCCTCTAT GGATTCATTT TCTACATATT CCATGTQGAA CTTAAAGAAC 300 

TGCCAATAAC AGGAACAGAC TTQGGAACAA ACACAAGCCC TCAGCACCAC TTTGAACAGA 360 

ATTCCATCAT CCCTGGCCCT CCCTTCCTAT TTGGAGCCTG TTCAGTACTG CTGGCTCTCC 420 

TTGTTGCCTT GTTTATTCCG GAACATACCA ATTTAAGCTT AAGGTCCAGC AGTTGGAGAA 480 

AGCACTGTGG CAGTCACAGC CATCCTCATA ATACACAAGC GCCAGGAGAC5 GCCAAAGAAC 540 

CTTTACTCCA GGACACAAAT GTGTGACGAC TGAAATCAGG AAGATTTTTC TATCAGCACC 600 

15 CAGGTCTTAG TTTTCACCTC TAGTTCTGGA TGTACATTCC ATTTCCATCC ACAGTGTACT - 660 

TTAAGATTGT CTTAAGAAAT GTATCTGCAT GAACTCCGTG GGAACTAAAG GAAGTGGGAA 720 

CTTAGAACCA GACAGTTTTC CAAAGATGTT ACAATTTCTT TTGAAAAACC TnTGTTTAT 780 

20 

TAGCACCAAT TTCTYGCCAC TAAGCTATTT GTTTTATTAT ACATCCTTTA ATTAAAAACT 840 

ATATATGTAA CTTCTTAGAT ATTAGCAAAT GTCTCTGCTA CCATTTCCTT AAGGTGTTGA 900 

25 GCTTTAACTC TATGCTGACT CAGTGAGACA CAGTAGGTAG TATGGTTGTG GACCTATTTG 960 

TTTTAACATT GTAAAATTTT GAGTCAGATT TTAATATTGT AAAATCTTGG GTCAAATAAT 1020 

TCAAAGCCTT AATGCAGATG CACTAAAACA AAGAAATGGT AAATGAATTG TTTGCATTTA 1080 

30 

AAAAAAAAAA CTCTTAAGAA AACTGTACTA AATCTGAATC ATGTTTTGAG CTTGTTTGCA 1140 

GTACTTTTAA ACATTATTCA CTACTGTTTT TGAAGTGAGA AAGTATCAGC CATTTAGCAT 1200 

35 TTAAGTTGGG GTATTTAGAG CCTGTAATCT AAATGCTGGC TCAAATTTAT TCCCCAGCTA 1260 

CTTCTTATAC CACTATTCTT TTAATGTTTG CATAATCATA AGCACCTCAA CACTTGAATA 1320 

CATAATCTAA AAATTATATA GTAAAGCTGG TAGCCTTGAA AATGTCAGTG TGATATCTAT 1380 

40 

TATGTAGATA AATATATATA GTGGCCTTTC AGGACTGTCA CAGTAACACT TTATTTACAG 1440 

AGCTAATGTT TGTCCTAAAT TTTCAGGACC CTAGAGGAC3A GCTTTATACA ATTACCGATG 1500 

45 TGAATTTCTC TAAAGTGTAT A1TTTTGTGT CCAGTTATAT TATTTAAAAA AGTGTTACTT 1560 

TGTAAAAATT GTATATAAAG AACTGTATAG TTTACACTGT TTTCATCTTG TGTGTGGTTA 1620 

TTGCTTAATG CTTTTTAAAC TTGGAACACT CACTATGGTT AAATAAGGTC TTAAAAGAAA 1680 

TGTAAATATT YTGTTAATAA AOTTAAATAT TTTAATGAT 1719 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 153: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 863 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEEttJESS : double 

(D) TOPOUX3Y: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 153: 

5 

GGCACGAGGG AAGCCGGGAC GATGTCCGCA TGACAACCX3A CXTTTGGAGTT TGGAGGTOCT 60 

TGCCTTAGAG CAAGGGAAAC AGCTCTCATT CAAAGGAACT AGAAGCCTCT CCCTCAGTGG 120 

10 TAGGGAGACA GCCAGGAGCG GTTTTCTQGG AACTGTGGGA TGTGCCCTTG GGGGCCCGAG 180 

AAAACAGAAG GAAGATGCTC CAGACCAGTA ACTACAGCCT GGTGCTCTCT CTGCAGTTCC 240 

TGCTGCTGTC CTATGACCTC TTTGTCAATT CCTTCTCAGA ACTGCTCCAA AAGACTCCTG 300 

15 

TCATCCAGCT TGTGCTCTTC ATCATCCAGG ATATTGCAGT CCTCTTCAAC ATCATCATCA 360 

TTTTCCTCAT GTTCTTCAAC ACCTTCGTCT TCCAGGCTGG CCTGGTCAAC CTCCTATTCC 420 

20 ATAAGTTCAA AGGGACCATC ATCCTGACAG CTGTGTACTT TGCCCTCAGC ATCTCCCTTC 480 

ATGTCTQGGT CATGAACTTA CGCTGGAAAA ACTCCAACAG CTTCATATGG ACAGATQGAC 540 

TTCAAATGCT GTTTGTATTC CAGAGACTAG CAGCAGTGTT GTACTGCTAC TTCTATAAAC 600 

25 

GGACAGCCGT AAGACTAGGC GATCCTCACT TCTACCAGGA CTCTTTGTGG CTGCGCAAGG 660 

AGTTCATGCA AGTTCGAAGG TGACCTCTTG TCACACTGAT GGATACTTTT CCTTCCTQGA 720 

30 TAGRAGGCCA CATTTGCTGC TTTGCAGGGG AGAGTTGGGC CCTATGCATC GGGCAAAACA 780 

GGTGGGATTT TCCAAGGGAA GGGTTCAGAA TTAGGCNTGT TGTTTCAGCC ATTTCCAAGG 840 

AAGGGGAAGG GTTTCCCTNC CCT 863 

35 



(2) INFORMATION FOR SEQ ID NO: 154: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
45 <D) TOPOXjOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

AACAGCAAAA AAGAATGATT TCTTCTGAAA TTGTGGAACA TGAGGATTCA AGTTTTTATT 60 

50 

TTGTTACTAG GTGCTGGAGG AACATCCCAG TTCACAAAGC CCCCATCTCT TCCTCTGGAG 120 

CCAGAGCCTG CGGTGGAATC TiAGTCCAACT GAAACATCAG AACAAATAAG AGAGAAATAA 180 

55 GAATAGAATG AATGACCCCA AAATARGGTT TTCTTGGGCG AGGATGTGCT GGATTAGGAA 240 

AGGTGACATG ACACAGGCAG AGCAGAGTGG CACCCACCAC AGAATACAGT GTGTGTTATT 300 

ACGAGGAGCC AGCAGTTGAG CCTAAGGTCC TTCTACCTAC CTGGTATTGG CATTTGAGGT 360 
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CX5GAAACCCT CTACTGCCCC ATAAGCCAGG AAAAGTGAAA AGAGAACACA GTTCCTTTAA 420 

GAACTGGCAG CAAGGCTTGA GGCCTTATGT ATCTAGCTGA GTCAGCAAGG TACATGATGC 480 

TGTCTGCTTT CAAAAGGACT TTTCTCTCCT AGCTGACTGA CTCCTTCCTT AGTTCAAGGA 540 

ACAGCTGAGA CAGACCTCTG CTGAGTAGCT CTGTGATGAC AAAGCCTTGG TTTAACTGAG 600 

GTGATCCTCA GGTTGTGAGG TTTATTAGTC CCCAAGGCAA ACACAAATAT TAGATTAATA 660 

ATCCAACTTT AATAGTATAC ATTTAAAAGA AAAAAAACAA AAGCCCTGGA AGNTTGAGGC 720 

CAAGCXTTGCT GAGTATTGCA GCTGCATTTG CCCAAAGGGA ATCCAGAACA AGTCCXTTCCX: 780 
15 TGTATTTTGT TCTTGAGAGG GGTCAGTCTA GAAGCTAGAT CCTATCAGGA TGAGGAGCAG ' 840 

CAGCCCAGGG CTTGTCTGGA TCAGCACCAA CGAITTTAAA GAAAAAAGGA AGAGmCTT 900 

AGATGAGTAA TTGTTATTGA AGATAGTCAG TGATAACCAC TGACCAGATG CTATCAATAC 960 

20 

ACTATGTGTC CTTTTTAGAA TAAAGATTAC ATATCATCAT TCCTTTQGGG AAAATTGTTA 1020 

TTCAGGTATA AAAACAAGAG ATTATAATAA AAAANTAAAA GAACCCTAAA AAAAAAAAAC 1080 

25 CTCGTGCCGA ATTCCCTGCA G 1101 

30 (2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2031 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEENESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

40 CAATTAACCC GTTTGAGGCC TAGGTTGTTT GGCAAGCCCC NGGCCTAAAG TTTTAATTCG 60 

GCAGAGCCAA GGGCCTGAAA GGAAGGGAAA GGGGAGGGTA GCGGGAGGGT AGCAGGTGAG 120 

TTCCTAGGGC TGGAAGGTTT AGCAGCAGCC TGGTGCAGTG CCCTGTCATC AAGACAAACC 180 

45 

CACGGTCCTC CTGGGTGCCT ACCAAGCTTG GTTTGTACAA AAGCAAGGTG GGAGTCTATT 240 

TTTGTACATG AGATACATCA CACTTACCTG TGQGCCAGTA TTGTGAAGTG AGTCTGAGTT 300 

50 GTTTACACTG ATGCCTTCCC TGCCCACCAC AAATTGTGTA CATAGTCTTC AGAATGATAC 360 

CACCCCTTTC CCCAGCTCCC AACCAAGAGC TGGTTCTAGG CCTGTGTTAT ATGTCATATT 420 

TAGCGTTTTT ATATATGACC TTTGATTTCT GTTGTTTGTA TTTTAGCACA GTGTATGCAC 480 

55 

CTTCATTTAA ATACATCTGT GTGCATACAG ATAOGCATAT ATGTGTGTGC GTATGCATAT 540 

ATCTCTCATC TGTAGTTTCC AAGAGTTCAG CTGAAGCAGA TGGAGTCCTG CAGCCCAGGA 600 
60 GACACCCTGC ATCCCTGCTA ATAGTGTTTG CCACAAGTAT TAGTGAGTCT TCCTTATTAA . 660 
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TATTTTCATT TCAGAAGACT GAAGCAAAGC TGATAC3TOTT TGCTGTTTCT TTGGCAGCTA 720 

AGTGAGGGTC TTGQGATGAC TTGCTrGTGTT CCTCAAGCTG CACTTTGGGG CCATCTCTGC 780 

5 

AGTATTAAGC CCCCTTTTTG CTTGGTGGTA CTCTGTCTGT GCCTOTGTGT GTGTGTGATA 840 



GTCACTCTTG CATGGCTTCC ATGTCTGGTT TGTGGCATTT GGGGATAAGT GCTGAACCAG 900 



10 AGCATTTGCA GTTTGTTTGA GGGCTCGTTG CXIAATGATAG ATCACTCCTG TTGACCTGGT 960 



ATGTCTGCTT GCTTGCTGCT TTTCCTTGCT TTCTCTTC3GA AGAGGAAAGG ACTCTGGTCA 1020 



GGCCCAOGCT GAGTGAGAOXS AGCTGCAGCT GGCTCATGGC CTTCTTAGAG CAGAGAGAGG 1080 

15 

AGTATGTCAT TTTACTAAGT TCCTAAACAA ACATTTATGC AGGCAACACT CCTTGCAGAT 1140 



CCAGAAACTG AGGCACAATA GGGTTATGAC TTGCTCAAGA ATATGTAGCT GCTAGGGGGT 1200 



20 AAATCAAGGC ATCACAATTT CTGTTCAGCG GGCAGGAATA GGCTGTGAAT TGCTAGCACT 1260 



TTTTTTTTAA GCAATTACTT TTTGACTTGT TCCTCTGAAA GTGCAAGAGG CX3TACACCTT 1320 



TCCCAAATGT AGACTAGAAT CTGCAGGATG CCACCCACTG TATAGTTCTG CTTTCCCAGA 1380 

25 

GAGGAAGAAC TTTTAGAAAC CAAATGATCT TAATTGTTAT TGCCCACCCC TGGCTTTTCC 1440 



GGGTAGAAAA TTCACAGTAG GAATGATTGT TAAGAGAGAG TGCTTGGAAC CATGGGTTAA 1500 



30 CAGGAAAGGC TACCTAACTT CACATATCTG CAACCAGAGC AGCCACCAAG CATTACTTAG 1560 



CAGCAGGAAA ATGATTGTAT TTGAGTTCCT GTGTGTCCAA AACTGAGGCA CCATGTTCTT 1620 



TGAAAACATG CCACCTCAAG GCTQGGCGCG GTGGCTCACA CCTGTTAATC CCAGCACTTT 1680 

35 

GGGAGGCCGA GGCGGGCGGA TCACCGGAGT CGGGGAGTTT GAGACCAGCC TGGACCAACA 1740 



TGGGAGAAAC CCCATCTCTA CCTAAAAATA CAAAATTAGC CGGGCGTGGT GGCATGCGCC 1800 



40 TATAATCTCA GCTACTTGGG AGGGYTGAGG CAGGRGAATT GCTTGAACCC RGGANGGCGG 1860 



AGGTTTGCGG TTGAGTTGAG GATCGTGCCA TTGCACTTCC GGGCCTTGGG GCAACAACAG 1920 



CAAAAAYTCC GTCTTCAAMW tJRTOCCXSAAT TCGATATCAA GCTTATCGAT ACCGTCGACX: 1980 

45 

TCGAGGGGGG GCCCGGTACC CAATTCGCCC TATAGNGATC GTATTACAAT C 2031 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 156: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGra: 1981 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
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25 



30 



35 



40 



45 
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CCTGCACCCT GAGCCCTTCA CCCCTCCGAG TTCCCCCCAG GTTGGCTTCC TTCGATTCCT 60 

TTTCTTQGTA TCAACGTTTG ATTGGAAGAA CAACCCCCTC TTTGTCAACC TCAATAATGA 120 

GCTCACTGTG GAGGAGCAGC TCGGGCACAG CTCMCXX3TYA TGGTCATTGT TACCCCCCAA 180 

GACCGCAAAA ACTCTGTGTG GACACAGGAT GGACCCTCAG CCCAGATCCT GCAGCAGCTT 240 

GTGGTCCTGG CAGCTGAAGC CCTGCCCATG TTAGAGAAGC AGCTCATGGA TQCCCQGGGA 300 

CCTGGGGACA TCAGGACAGT GTTCCGGCCG CCCTTGGACA TTTACGACC3T GCTGATTCGC 360 

CTGTYICCTC GCCATATCCC GCGGCACCGC AGGCTTGTGG ACTCGCCAGY- TCCCTCCOTC 420 

TGCCGGGGCX: TGCTCAGCCA GCXXX3GQCCC TCATCCCTGA TGCCCGTGCT GGGTOATCAT 480 

CCTNCTCAGC TCTATCTGAC GCAGCTCAGG GAGGCCTTTG GGGATCTGGC CCTTTTCTTC 540 

TATGACCAGC ATGGTGGAGA GGTGATTGGT GTCCTCTGGA AGCCCACCAG CTTCCAGCCG 600 

GAGCCCTTCA AGGCCTCCAG CACAAAGGGG CGCATGGTGA TGTCTCGAGG TGGGGAGCTA 660 

GTAATGGTGC CCAATGTTGA AGCAATCCTG GAGGACTTTG CTGTGCTGGG TGAAGGCCTG 720 

GTGCAGACTG TGGAGGCCCG AAGTGAGAGG TGGACTGTGT GATCCCAGCT CTGGAGCAAG 780 

CTGTAGACGG ACAGCAGGAC ATTGGACCTC TAGAGCAAGA TGTCAGTAGG ATGACCTCCA 840 

CCCTCCTTGG ACATGAATCC TCCATGGAGG GCCTGCTGGC TGAACATGCT GAATCATCTC 900 

CAACAAAACC CAGCCCCAAC TTTCTCTCTG ATGCTCCAGC ATTGGQGCAG GQGCATGGTG 960 

GCCCATGTAG TCTCCTGGGC CTCACCATCC CAGAAGAGGA GTGGGAGCCA GCTCAGAGAA 1020 

GGAACTGAAC CCAGGAGATC CATCCACCTA TTAGCCCTGG GCCTGGACCT CCCTGCGATT 1080 

-TCCCACTCCT--TTCriTAGTCT-TGTTCGAGAA-AGAGAGtf^^ - - -1.140- 

CTCTGTCTCC TTCCTGCTGC CAGGACCTGT GCCTAGACTT AGCATGCCCT TCACTGCAGT 1200 

GTCAGGCCTT TAGATGGGAC CCAGCGAAAA TGTGGCCCTT CTGAGTCACA TCACCGACAC 1260 

TGAGCAGTGG AAAGGGGCTA TATGTGTATG AATAGACCAC ATTGAAGGAG CACAATGCCC 1320 

TCCTGTGTTG ATGCCACTTC CCAGGGTGGA GACAGTGGAA AAGAACCGAG GACAGGAAAG 1380 

GATTGGGTAG GTGAAGGGGT CAGGGGACTG GTAGTCACCC AATCTTGGAG AGGTGCAAAA 1440 

AGCACTGGGG GCTACCCGTT AGCTGCATCT GCCCTGGCTG TTTGCCCGTr CATGTCACAA 1500 

ACTGCCACTA CTATGTACCT GCAGTGGGGT TGCAGAGATG GGGGAGACTC AAGTCTTACT 1560 

CCCCAGGAGC TCCCAGGGCC CAAGGAGGAG AATGCTGCCT CCTTTCAGTC TGGTCTACAC 1620 

CCACTTTCTG GTAGCCTCTC TGCTTCCTGT AATTCTGGCT GTTTTTCCAG ACTCAGCTCA 1680 

AATAGTGCCC CTCCTTAAGC CCATCCCTCG CCCCCAGCCT GAGGTGATCT TTCCCTCCTC 1740 

TGAACTATTA GAGCAGTTAC TGTCTGTTCA GTTCGTTTGG CAGGCACACA CAGTGGCATA 1800 
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AATTCTATTG TTTTGAACTC TGATTTAAAA TTAAATTGCA GCTGGGCGTG GTGGCTCATG 1860 

dTGTAATCC CAACACTTAG GGAGTMAGGR GAATCACTTG ASCYCAGGAG TYCTAGACCA 1920 

ATCTGGGCAA MAGAGAGACC CCATCTCTTT TAAATAAAAA GTTAAATTGC TTAAAAAAAA 1980 

A 1981 

(2) INFORMATIC»I FOR SEQ ID NO: 157: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
/(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

GAATTCGGCA CGAGCGCGGC CATGGCGCTC CTGCTTTCGG TGCTGCGTGT ACTGCTGGGC 60 

GGCTTCTTCG CGCTCGTGGG GTTGGCCAAG CTCTCGGAGG AGATCTCGGC TCCAGTTTCG 120 

25 

GAGCGGATGA ATGCCCTGTT CGTGCAGTTT GCTGAQGTGT TCCCGCTGAA GGTATTTGGC 180 

TACCAGCCAG ATCCCCTGAA CTACCAAATA GCTGTGGGCT TTCTGGAACT GCTGGCTGGG 240 

30 TTGCTGCTGG TCATGGGCCC ACCGATGCTG CAAGAGATCA GTAACTTGTT CTTGATTCTG 300 

CTCATGATGG GGGCTATCTT CACCTTGGCA GCTCTGAAAG AGTCACTAAG CACCTGTATC 360 

CCAGCCATTG TCTGCCTGGG GTTCCTGCTG CTGCTGAATG TCGGCCAGCT CTTAGCCCAG 420 

ACTAAGAAGG TGGTCAGACC CACTAGGAAG AAGACTCTAA GTACATTCAA GGAATCCTGG 480 



AAGTAGAGCA TCTCTGTCTC TTTATGCCAT GCAGCTGTCA CAGCAGGAAC ATGGTAGAAC 540 

40 ACAGAGTCTA TCATCTTGTT ACCAGTATAA TATCCAGGGT CAGCCAGTGT TGAAAGAGAC 600 

ATTTTGTCTA CCTGGCACTG CTTTCTCTTT TTAGCTTTAC TACTCTITTG TGAGGAGTAC 660 

ATGTTATGCA TATTAACATT CCTCATGTCA TATGAAAATA CAAAATAAGC AGAAAAGAAA 720 

45 

TTTAAATCAA CCAAAATTCT GATGCCCCAA ATAACCACTT TTAATGCCTT GGTGTAAGTA 780 

TACCTCTGAA CTTTTTTCTG TGCCTTTAAA CAGATATATA TTTTTTTTWA ATGAAAAT7VA 840 

50 AACCATATAT CCTATTTTAT TTCCTCCTTT TAAAACCTTA TAAACTATAA MAAAAAAAAA 900 

AAAAAAAAAA CTCGA 915 



(2) INFORMATION FOR SEQ ID NO: 158: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 2117 base pairs 
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(B) TYPE: nucleic acid 

(C) STRM^DElijESS: double 

(D) TOPOUXSTY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

AGAGCGAAGC GAGQGTGGCG CGGGTCCGGG CATGAAGCTG GGCCGGGCCG TGCTGGGCCT 60 

GCTGCTGCTG GCGCCGTCCG TQGTGCAGGC GGTGGAGCCC ATCAGCCTGG GACTGGCCCT 120 

GGCCGGCGTC CTCACCGGCT ACATCTACCC GCGTCTCTAC TGCCTCTTCG CCGAGTGCTG 180 

CGGGCAGAAG CGGAGCCTTA GCCGGGAGGC ACTGCAGAAG GATCTGGAOG ACAACCTCTT 240 

15 TGGACAGCAT CTTGCAAAGA AAATCATCTT AAATGCCGTG TTTGGTTTCA TAAACAACCC' 300 

AAAGCCCAAG AAACCTCTCA CGCTCTCCCT GCACGGGTGG ACAGGCACCG GCAAAAATTT 360 

CGTCAGCAAG ATCATCGCAG AGAATATTTA CGAGGGTGGT CTGAACAGTG ACTATGTCCA 420 

20 

CCTGTTTGTG GCCACATTGC AdTTCCACA TGCTTCAAAC ATCACCTTGT ACAAGGATCA 480 

GTTACAGTTG TGGATTCGAG GCAACGTGAG TGCCTGTGCG AGGTCCATCT TCATATTTGA 540 

25 TGAAATGGAT AAGATGCATG CAGGCCTCAT AGATGCCATC AAGCCTTTCC TCGACTATTA 600 

TGACCTGGTG GATGGGGTCT CCTACCAGAA AGCCATGTTC ATATTTCTCA GCAATGCTGG 660 

AGCAGAAAGG ATCACAGATG TGGCTTTGGA TTTCrGGAGG AGTQGAAAGC AGAGGGAAGA 720 

30 

CATCAAGCTC AAAGACATTG AACACGCGTT GTCTGTGTCG GTTTTCAATA ACAAGAACAG 780 

TGGCTTCTGG CACAGCAGCT TAATTGACCG GAACCTCATT GATTATTTTG TTCCCTTCCT 840 

35 CCCCCTGGAA TACAAACACC TAAAAATGTG TATCCGAGTG GAAATGCAGT CCCGAGGCTA 900 

. TGAAATTGAT- XiAAGACATTG- TAAGCAGAGT-GGCTGAGGAG-ATGACATTTP-TCGGGAAAGA SSQ-- 

GGAGAGAGTT TTCTCAGATA AAGGCTGCAA AACGGTGTTC ACCAAGTTAG ATTATTACTA 1020 

40 

CGATGATTGA CAGTCATGAT TGGCAGCCGG AGTCACTGCC TGGAGTTGGA AAAGAAACAA 1080 

CACTCAGTCC TTCCACACTT CCACXTCCCAG CTCCTTTCCC TGGAAGAGGA ATCCAGTGAA 1140 

45 TGTTCCTGTT TGATGTGACA GGAATTCTCC CTGGCATTGT TTCCACCCCC TGGTGCCTGC 1200 

AGGCCACXXIA GGGACCACGG GCGAGGACGT GAAGCCTCCC GAACACGCAC AGAAGGAAGG 1260 

AGCCAGCTCC CAGCCCACTC ATCGCAGGGC TCATGATTTT TTACAAATTA TGTTTTAATT 1320 

50 

CCAAGTGTTT CTGTTTCAAG GAAGGATGAA TAAGTTTTAT TGAAAATGTG GTAACTTTAT 1380 

TTAAAATGAT TTTTAACATT ATGAGAGACT GCTCAGATTC TAAGTTGTTG GCCTTGTGTG 1440 

55 TGTGTTTTTT TTTAAGTTCT CATCATTATT ACATAGACTG TGATGTATCT TTACTGGAAA 1500 

TGAGCCCAAG CACACATGCA TGGCATTTGT TCCACAGGAG GGCATCCCTG GGGATGTGGC 1560 

TGGAGCATGA GCCAGCTCTG TCCCAGGATG GTCCCAGCGG ATGCTGCCAG GGGCAKTGAA 1620 

60 
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GTGTTTAGGT 
AACAGCCTCT 
CACACTGTCA 
TTGCCAGTCA 
CGTGCTCTCT 
TACTATTATT 
CTTCTGAGGA 
ATAAAATTTT 
GTAACCAATT 



GAAGGACAAG 
CXMGGGTTT 
ACATTTGTTA 
CTCOSTTTTG 
GCTGGATTCT 
TGTTACXnTC 
TGCTGAGAAC 
GCATGTGCAA 
•TCNCXXTC 



taggtaagag 
tcaccttagc 
gaacx:agtct 
caaaaggtgg 
aaattatgtg 
aatcagaatc 

GGrrGTCTTTC 

aaaaaaaaaa 



gacgccttca 
aacaatggga 
tttgaaagaa 
ccxrrrcACTG 

AATTTTGCCA 

cccgaaacct 
tttataaatg 
anaaaaaaaa 



GGCACCACAG 
GCTGTCGGAG 
AAGTATTTCC 
TCCATTCCAA 
TATTAAATCr 

cxttataaacsc 
caaatggcta 
aaaatccx:gg 



ATAAGCCTGA 
TGATTTTGGC 
AACTTGTCAC 
ATAGCCCACA 
TCCTCATTTA 
TTAGCTGCCC 
CCGTTTTACA 
GGGGGGGCCG 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2395 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
TGTTCCTTAA TCCCTTTTCT AAAAAGGGGG GAAAATCCGG ATGGATTTTA GQGATTGGTC 
TGGTGTCAGC TGTGTTTTAT TGCACACCTA AATCCTGATT ATAGGCTTTT CATTTCTCCG 
CAAAGCCTTT ATTTTOQCAG TTAAGCCAAA TGTGTTTTCC AGAAAGTTAG TTA I ' l ' lT CTC 
CTCTTTCTTT CCTTTCTTTC CTCCCmTT CCCGTCTGAC CCCAAACGTT ATTGTCCAAA 
CATGACTGGA CAGCAGCTTT TGTTTCTTGA CCCTGTAATA TGACAGTCTG CTAATATTGA 
CAGAAGGrrGC AGmTTGGG TTATAGTCGT GATTTTCGCT AATCAATCAT ATTAGCAGGA 
AATU^AAAKGA CTTGTTTCTG TTGTACTTGA GTCTTAAGAA AAAGTGGCCC ATAGTTTAGT 
GGACAATTTC CAAAGGCTTT AGTACCACCT GTATTTCAAA ATGQGGGACC CAAACTCCCG 
GAAGAAACAA GCTCTGAACA GACTACGTGC TCAGCTTAGA AAGAAAAAAG AATCTCTAGC 
TGACCAGTTT GACTTCAAGA TGTATATTGC CTTTGTATTC AAGGAGAAGA AGAAAAAGTC 
AGCACTTTTT GAAGTGTCTG AGGTTATACC AGTCATGACA AATAATTATG AAGAAAATAT 
CCTGAAAGGT GTGCGAGATT CCAGCTATTC CTTGGAAAGT TCCCTAGAGC TTTTACAGAA 
GGATGTGGTA CAGCTCCATG CTCCTCGATA TCAGTCTATG AGAAGGGATG TAATTGGCTG 
TACTCAGGAG ATGGATTTCA TTCTTTGGCC TCGGAATGAT ATTGAAAAAA TCGTCTGTCT 
CCTGTTTTCT AGGTGGAAAG AATCTGATGA GCCTTTTAGG CCTGTTCAGG CAAATTTGAG 



1680 
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1800 
1860 
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1980 
2040 
2100 
2117 
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TTTCATCATG GTGACTATGA AAAACAGTTT CTGCATCTAC TGAGCCGCAA GGACAAGACT 
GGAATCGTTG TCAACAATCC TMCCACTTCA GTGTTTCTCT TCATTGACAG ACAGCACTTG 
CAGACTCCAA AAAACAAAGC TACAATCTTC AAGTTATGCA GCATCTGCCT CTAOCTGCCA 
CAGGAACAGC TCACCXiywrrG GGGCftGTTGG CACCATAGAG GRTCACCTCC GTCCTTATAT 
GCCAGAGTAG AGTACTGACC AGCAAAATGG AGAAGATCAG AGAATGCAGC AGCAGTTTTT 
TTrCTTGTTT TCTTACCACT TTATTCTTTC AGAGTTTAAA GAAAATQGAC TCATGCACAG 
AACACTATGC ATTTTGAAAC TTGTTCATCC TCGATTTTTT TAAATCATTT TTATCTCAGA 
ACTTAAACAA AAATTAGATG TCGTGCACGG ACTGTGTGAA AGAAGATGCT TTGCATATTT 
GCTGCACTGC ATCAGTATCT TACTAAAAAT GTGAAATGAA AGGACTATTG TACACTGAAA 
TGCTTAAATG TATCTGAAAG CACAAGGTGA TACTCATTTT TATGGTCTTC CX:ATTTGTGC 
TGC5TTTTTGC CTCTTTGACA TCTGTCATCA GTATTTAGAG C3GTGAGAAGT GAATGTAACA 
GC3TATAAATA ACATTTTTAA AAACAATAAC TTTGCTATAA TCACAGTTGT TCCAGAGCAC 
TGTCAGATAC ATTCTAATGA CCAGAACTGG TTTAAAAAAA GAAAATACAA CCATGGGAAA 
GAAATCTTAA ATGAAAAACG CATCTCATTG TAGGCATTTT TCCCTCATAT TTTACTOGGC 
CATGTTTGTT TCCTGGTACT CATGTATTTT TTTTTTCCAG ATCTCTITCC (XIAAGTTGCT 
ATTGTAAGAG TATTCTGCTG CGTGTGGATG CAGTTATACA CATTAAAGCA GATCTGGAGT 
CTGAAGTAGC TATAAAGCAG CTATAAAACA GAAATACATG CATAGCTGCA GAAACCATGA 
TAGGTAGAGG ACmTCTTT TGGTTTTGTT TTGTTTTGTT TTGTTTTGTT TTTGGTTTTA 



CAGAGAAGAG ATTTTTATTA CAAAGAAAAA AATTCCAGTG AATTGTGCAG AAATXSCTGGT 
TTTTACACCA TCCTAAAGAA AAACTTTACA AGGGTGTTTT GGAGTAGAAA AAAGGTTATA 
AAGTTGGAAT CTTAAATrGT AAAATTAACC ATTGAGTGTC AAAGTTCTAA AAGCftGAACT 
CATTTTGTGC AATGAACATA AGGAAAGACT ACTGTATAGG ITrrmTrr TTCTCCTTTT 
AAATGAAGAA AAGCTTTGCT TAAGGGTTGC ATACTTTTAT TGGAGTAAAT CTGAATGATC 
CTACTCCTTT GGAGTAAAAC TAGTGCTTAC CAGTTTCCAA TTGTATTTAG CTTCTGGTTG 
GAATTTGAAA AAAAAAGAAA AAAAGAAAAA GAAAACCTAA ATAAAATAGG TGAAA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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1620 
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55 (2) INFORMATIC»a FOR SEQ ID NO; 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2120 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

5 CCCCQGATAC CGCCTGACGT AGTGCCAATC ACACCTCTCG CGTCTCGGCG CCTCGGAGGC 60 

TAATGAGGAC GCCTGGCGAA ACGCAGTAAC GGATTTCCGG GTGGACCTTC GCTTTAOC5GC 120 

TCGTGAGTTC TTCCGCCCAA CCCAGAGGAA GCGGGAGAGC AGTTTACGAC AGCGCCGGTC 180 

10 

GTGTTTACGG CGGCGCCCGC TGCGCGCGCA TGTTTCCTCT TTTCCTQGTT TCTCAAGAGT 240 

GCTGCTGCTA ACGCGGTCCC CQGCACGCAC CATCTGTTGC CATCCCGGCC GGCCGAGGCA 300 

15 TTGCAGATTT TGGAAGATGG CAAAGTTCAT GACACCCGTG ATCCAGGACA ACCCCTCAGG " 360 

CTGGGGTCCC TGTGCGGTTC CCX5AGCAGTT TCGGGATATG CCCTACCAGC OGTTCAGCAA 420 

AGGAGATCGG CTAGGAAAGG TTGCAGACTG GACAGGAGCC ACATACCAAG ATAAGAGGTA 480 

20 

CACAAATAAG TACTCCTCTC AGTTTGGTGG TGGAAGTCAA TATGCTTATT TCCATGAGGA 540 

GGATGAAAGT AGCTTCCAGC TGGTGGATAC AGCGCGCACA CAGAAGACGG CCTACCAGCG 600 

25 GAATCGAATG AGAITTGCCC AGAGGAACCT CCGCAGAGAC AAAGATCGTC GGAACATGTT 660 

GCAGTTCAAC CTGCAGATCC TGCCTAAGAG TGCCAAACAG AAAGAGAGAG AACGCATTCG 720 

ACTGCAGAAA AAGTTCCAGA AACAATTTGG GGTTAGGCAG AAATQGGATC AGAAATCACA 780 

30 

GAAACCCCGA GACTCTTCAG TTGAAGTTCG TAGTGATTGG GAAGTGAAAG AGGAAATGGA 840 

TTTTCCTCAG TTGATGAAGA TGCGCTACTT GGAAGTATCA GAGCCACAGG ACATTGAGTG 900 

35 TTGTQGGGCC CTAGAATACT ACGACAAAGC CTTTGACCGC ATCACCACGA QGAGTGAGAA 960 

GCCACTGCGG- ASATNCAAGC "GCATCTTCCA XACTGTCACC~ffiCiC:AGRCG~ACCCTGTa^T T02t) ~ 

CCGCAAGCTG GCAAAAACTC AGGGGAATGT GTTTGCCACT GATGCCATCC TGGCCACGCT 1080 

40 

GATGAGCTGT ACCCGCTCAG TGTATTCCTG GGATATTGTC GTCCAGAGAG TTGGGTCCAA 1140 

ACTCTTCTTT GACAAGAGAG ACAACTCTGA CTTTGACCTC CTGACAGTGA GTGAGACTGC 1200 

45 CAATGAGCCC CCTCAAGATG AAGGTAATTC CTTCAATTCA CCCCGCAACC TGGCCATGGA 1260 

GGCAACCTAC ATCAACCACA ATTTCTCCCA GCAGTGCTTG AGAATGGGGA AGGAAAGATA 1320 

CAACTTCCCC AACCCAAACC CGTTTGTGGA GGACGACATG GATAAGAATG AAATCGCCTC 1380 

50 

TGTTGCGTAC CGTTACCGCA GTGGNAAGCT TGGAGATGAT ATTGACCTTA TTGTCCGTTG 1440 

TGAGCACGAT GGCGTCATGA CTGGAGCCAA CGGGGAAGTG TCCITCATCA ACATCAAGAC 1500 

55 ACTCAATGAG TGGGATTCCA GGCACTGTAA TGGCX3TTGAC TGGCGTCAGA AGCTGGACTC 1560 

TCAGCGAGGG GCTGTCATTG CCACGGAGCT GAAGAACAAC AGCTACAAGT TGGCCCGGTG 1620 

GACCTGCTGT GCTTTGCrGG CTGGATCTGA GTACCTCAAG CTTGGTTATG TGTCTCGGTA 1680 

60 
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CX:ACGTGAAA GACTCCTCAC GCCACGTCAT CCTAGGCACC CAGCAGTTCA AGCCTAATCA 1740 

GTTTGCCAGC CAGATCAACC TGAGCGTGGA GAATGCXTOG GGCATTTTAC GCTGCGTCAT 1800 

5 TGACATCTGC ATGAAGCTGG AGGAGGGCAA ATACCTCATC CTCAAGGACC CCAACAAGCA 1860 

GGTCATCCGT GTCTACAGCC TCCCTGATGG CACCTTCAGC TCTGATGAAG ATGAGGAGGA 1920 

AGAGGAGGAG GAAGAAGAGG AAGAAGAAGA GGAAGAAACT TAAACCAGTG ATCTGGAOCT 1980 

10 

GGAGTTTGTC CTTCCACCXSA GACTACGAGG GCCTTTGATG CTTAGTQGAA TGTGTGTCTA 2040 

ACTTGCTCTC TGACATTTAG CAGATGAAAT AAAATATATA TCTGTTTAGT CTTAAAAAAA 2100 

15 AAAAAAAAAA AAAAAAAAAN 2120 



20 (2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

30 GGAAGCTQAA GTCCTTCCAG ACCAGGGACA ACCAGGGCAT TCTCTATGAA GCTGCACCCA 60 

CCTCCACCCT CACCTGTRAC TCAGGACCAC AGAAGCAAAA GTTCTCACTC AAACTGGATG 120 

CCAAGGATGG GCGCTTGTTC AATGAGCAGA ACTTCTTCCA GCGGGCCGCC AAGCCTCTGC 180 

35 

AAGTCAACAA GTGGAAGAAG CTGTACTCGA CCCCACTGCT GGCCATCCCT ACCTGCATGG 240 

GTTTCGGTGT TCACCAGGAC AAATACAGGT TCTTGGTGTT ACCCAGCCTG GGGAGGAGCC 300 

40 TTCAGTCGGC CCTGGATGTC AGCCCAAAGC ATGTGCTGTG CAGAGAGGTC TGTGCTGCAG 360 

GTGGCCTGCC GGCTGCTGGA TGCCCTGGAG TTCCTCCATG AGAATGAGTA TGTTCATGGA 420 

AATGTGACAG CTGAAAATAT CTTTGTGGAT CCAGAGGACC AGAGTCAGGT GACTTTGGCA 480 

45 

GGCTATQGCT TCGCNTTCCG CTATTGCCCA AGTQGCAAAC ACGTGGCCTA OGTGGAAGGC 540 

AGCAGGAGCC CTCACGAGGG GGACCTTGAG TTCATTAGCA TGGACCTGCA CAAGGGATGC 600 

50 GGGCCCTCCC GCCGCRGCGA CCTCCAGAGC CTGGGCTACT GCATGCTGAA GTGGCTCTAC 660 

GGGTTTCTGC CATGGACAAA TTGCCTTCCC AAMAMTGAGG ACATCATGAA GCAAAAACAG 720 

AAGTTTGTTG ATAAGCCGGG GCCCTTCGTG GGACCCTGCG GTCACTGGAT CAGGCCCTCA 780 

55 

GAGACCCTGC AGAAGTACCT GAAGGTGGTG ATGGCCCTCA CGTATGAGGA GAAGCCGCCC 840 

TACGCCATGC TGAGGAACAA CCTAGAAGCT TTGCTGCAGG ATCTGCGTGT GTCTCCATAT 900 

60 
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(2) INFORMATICaa FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGy: linear 



(xi) SEQimiCE DESCRIPTION: SEQ ID NO: 162: 

GGCACX3AGAT GAGQGGCACC CAGTGCTTCT AGGGCAGGCT GGGTGGTGGT CCCCTAGGTA 60 

15 TCAGCCTCTC TTACTGTACT CTCCGGGAAT GTTAACCTTT CTATTTTCAG CCTGTGCCAC 120 

CTGTCTAGGC AAGCTGGCTT CCCCATTGGC CCCTGTGGGT CCACAGCAGC GTGGCTGCCC 180 

CCCAGGGCCA CCGCTTCTTT CTTGATCCTC TTTCCTTAAC AGTGACTTGG GCTTGAGTCT 240 

20 

GGCAAGGAAC CTTGCTTTTA GCTTCACCAC CAAGGAGAGA GGTTGACATG ACCTCCCCGC 300 

CCCCTCACCA AQGCTGGGAA CAGAGGGGAT GTGGTGAGAG CCAGGTTCCT CTGGCCCTCT 360 

25 CCAGGGTGTT TTCCACTAGT CACTACTGTC TTCTCCTTGT AGCTAATCAA TCAATATTCT 420 

TCCCTTGCCT GTGGGCAGTG GAGAGGCTGC TGGGTGTACG CTGCACCTGC CCACTGAGTT 480 

GGGGAAAGAG GATAATCAGT GAGCACTGTT CTGCTCAGAG CTCCTGATCT ACCCCACCCC 540 

30 

CTAGGATCCA GGACTGGGTC AAAGCTGCAT GAAACCAGGC CCTGGCAGCA AACCTGGGAA 600 

TGGCTGGAGG TGGGAGAGAA CCTGAACTTC TCTTTCCCTC TCCCTCCTCC AACATTACTG 660 

35 GAACTCTATC CTGTTAGGAT CTTCTGAGCT TGTTTCCCTG CTGGGTGGGA CAGAGGACAA 720 



AGGAGAAGGG AGGGTCTAGA AGAGGCAGCC CTTCTTTGTC CTCTGGGGTA AATGAGCTTG 780 

ACCTAGAGTA AATGGAGAGA CCAAAAGCCT CTGATITTTA ATTTCCATAA AATGTTAGAA 840 

GTATATATAT ACATATATAT ATTTCTTTAA ATTTTTGAGT CTTTGATATG TCTAAAAATC 900 

CATTCCCTCT GCCCTGAAGC CTGAGTGAGA CACATGAAGA AAACTGTGTT TCAITTAAAG 960 

45 ATGTTAATTA AATGATTGAA ACTTGAAAAA AAAAAAAAAA AAA 1003 



50 (2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2196 beise pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
60 AAGAAGCGGC ACACGGATGT GCAGTTCTAC ACAGAAGTGG GAGAGATAAC CACGGACTTG 60 
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GGGAAACATC AGCATATGCA TGACCGAGAT GACCTCTATG CTGAGCAGAT GGAACGAGAA 120 

ATGAGGCACA AACTGAAAAC AGCCTTTAAA AATTTCATTG AGAAAGTAGA GGCTCTAACT 180 

5 

AAGGAGGAAC TGGAATTTGA AGTGCCTTTT AGGGACTTGG GAITTAACGG AGCTCCXTTAT 240 

AGGAGTACCT GCCTCCTTCA GCCCACTAGT AGTGCGCTGG TAAATGCTAC GGAATGGCCA 300 

10 CCTTTTGTGG TGACATTGGA TGAGGTAGAG CTGATCGACT TTRAGCGGGT CXAGTTTCAC 360 

CTGAAGAACT TTGATATOGT AATCGTCTAC AAGGACTACA GCAAGAAAGT GACXIATGATC 420 

AACGCCATTC CTGTAGCCTC TCTTGACCCC ATCAAGGAAT GGTTGAATTC CTGCGACCTG 480 

15 

AAATACACAG AAGGAGTACA GTCCCTCAAC TGGACTAAAA TCATGAAGAC CATTGTTGAT 540 

GACCCTGAGG GCTTCTTCGA ACAAGGTGGC TGGTCTTTCC TGGAGCCTGA GGGTGAGGGG 600 

20 AGTCATGCTG AAGAAGGGGA TTCAGAGTCT GAAATTGAAG ATGAGACTTT TAATCCTTCA 660 

GAAGATGACT ATGAAGAGGA AGAGGAGGAC AGTGATGAAG ATTATTCATC AGAAGCAGAA 720 

GAGTCAGACT ATTCTAAGGA GTCATTGGGT AGTGAAGAAG AGAGTGGAAA GGAOTGGGAT 780 

25 

GAACTGGAGG AAGAAGCCCG AAAAGCGGAC CGAGAAAGTC GTTACGAGGA AGAAGAAGAA 840 

CAAAGTCGAA GTATGAGCCG GAAGAGGAAG GCATCTGTGC ACAGTTCGGG CXX3TGGCTCT 900 

30 AACCGTGGTT CCAGACACAG CTCTGCACCC CCCAAGAAAA AGAGGAAGTA ACTTCTGAAC 960 

TTTGGCCCTG AGCTCCATTC TTCCTCCAGC CAACCCCTGA AAATTTTACA TGACATAGAA 1020 

ACTGTATTTT TCCTTTCGTT OTCATTTGAA GTTTTGCXIAT TTGTGTTTAT GGGTTTAGGG 1080 

35 

GGCCATTTGT GTGGACCAAT CTACTCGGGG AATTCCAGGC CCACCAGGAC ACGTGCCAAT 1140 

GGCCCCATTC AGATGGCAAG GGAGGAGGTG TTCTTGAAGA CAGGAGGAGG CTCCCGCTGT 1200 

40 TAATAAATAT TGTTTCATTC TTCTCTCTTC CTGTCACCTT CTGCCAAGAC ATTGATGGCT 1260 

TCTGACATCT TATTTGGTGT CTCAAAGCTG TATTTCCAAG ACAGTGGTAC AAGGTGACXTC 1320 

TTAATTACCC GTATCATGGT TCTTGACCAG CACATTCAAT CCTCCAACCT ACCCTACTGC 1380 

45 

CATGACCTTC CGCACATCTC TAAGTTTTAT CTTTGCAATA CTCAAGGTTC TCGGAAATTT 1440 

GCTAATGGTT GTGATAAACC ATACAGCTTG AGCCAGTGAG GCAGATTGGG CTGGTGCCTT 1500 

50 CX5TCTGAGTT TTCCTGCTTT CCTGCCTCGT GCAGATTCTG AGGTATATCT GCTGCCTTGG 1560 

AAGACATAAG AAGCAGTGAT ACTCCCTGGC TCGGTTATTT TCTCCATACA ATGCACACAT 1620 

GGTACAATGA TAGAAGGCAA AATTGCCACT GTCTTCTTTT TTTTCTCATA TATCTAAGGA 1680 

55 

AGATATATCA GGTTGTGCCT CATGTACCGC TTCTAGTGAA ATGTAGAGGA AGGCTCAAAG 1740 

GAGTCAACAT TTAGATCTGG AAGGGACAAG TCATGCCTTG GGCCTAGAAT ACCCTGATGA 1800 

60 GAAAAGAGAA GAGGAAGGGA GGCCATATCT ACAACANCAN CCTCTCGGCA CTGCTGCTCC 1860 
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TTATTTTAAC TTTGTCTTCC ATTCTCCTGT ATTTATCACA GTTTCTGTTG AACAGCTTTT 1920 

CAAGTATTTG GGGAGTTTAT CTTGCCATCC TCCCCTTCTG GTTCTCTGCA CCCACCTCTC 1980 

CCACTGCAGT TCCTTCCGTG CTCTGrGACT TTAAGAGAAG AAGOGGGGAG GGGTCCXX3GA 2040 

TTTTATGTTT Grr T Cri ' l T l ' f TCTCCTTAGC ACTAGGACTT GATATTTTCA ATTTTGGAAG 2100 

AACTAAAAGA TGAATAAACT GQGTTTTTTT TGTTGTTTGT TTTTGTAAAA AAAAAAAAAA 2X60 

AAAAAAAAAA AAAAAAAAAA AAAAAAAATU^ AAAAAA 2196 



15 



20 
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(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
GCACAGAGTC GQGCGGACGG ACAGGGAGAG GAGGAGAGGG GGTCTGCGCG CGGCCGCTAC 
CCAGAAGCCA GCGGAOGGCA GCACGGAGTG GGCTGTCCCC GAGCCCAGCC CCGAGCGAGC 
CCCCCCCCCG CCCCC04AGG ACGCGCCTYC CAGCCAGCCC GACTYCTAGG AGGAGGGGAG 
GCGGGAAAGC AGCTCAAGCC TCACCCACCG CCCTGCCCCC AGCCCCGCCA CTCCCAGGCT 
CCTCGGGACT CGGCGGGTCC TCCTGGGAGT CTCGGAGGGG ACCGGCTGTG CAGACGCCAT 



GGAGTTGGTG CTGGTCTTCC TCTGCAGCCT GCTGGCCCCC ATGGTCCTGG CCAGTGCAGC 
TGAAAAGGAG AAGGAAATGG ACCCTTTTCA TTATGATTAC CAGACCCTGA GGATTGQGGG 
ACTGGTGTTC GCTGTGGTCC TCTTCTCGGT TGGGATCCTC CTTATCCTAA GTCGCAGGTG 
CAAGTGCAGT TTCAATCAGA AGCCCCGGGC CCCAGGAGAT GAGGAAGCCC AGGTGGAGAA 
CCTCATCACC GCCAATGCAA CAGAGCCCCA GAAAGCAGAG AACTGAAGTG CAGCCATCAG 
GTGGAAGCCT CTGGAACCTG AGGCGGCTGC TTGAACCTTT GGATGCAAAT GTCGATGCTT 
AAGAAAACCG GCCACTTCAG CAACAGCCCT TTCCCCAGGA GAAGCCAAGA AdTGTGrrGT 
CCCCCACCCT ATCCCCTCTA ACACCATTCC TGCACCTGAT GATGCAACTA ACACTTGCCT 
CCCCACTGCA GCCTGCGGTC CTGCCCACCT CCCGTGATGT GTGTGTGTGT GTGTGTGTGT 
GTGACTGTGT GTGTTTGCTA ACTGTGGTCT TTGTGGCTAC TTGTTTGTGG ATGGTATTGT 
GTTrGTTAGT GAACTGTGGA CTCGCTTTCC CAGGCAGGGG CTGAGCCACA TGGCCATCTG 
CTCCTCCCTG CCCCCGTGGC CCTCCATCAC CTTCTGCTCC TAGGAGGCTG CTTGTTGCCC 



60 
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GAGACCAGCC CCCTCCCCTG ATTTAGGGAT 
CAGTCGTCTT GGGACCTGGG AAGGTTTGCA 
5 TTCACTCCTT TAACAAAAAC CTTGCTTCCT 
TTAGCAACTG GAGATACAAA GCAAGGAGCT 
ATGCCCTTCC GTGGTOAATT TCTTCCCAGG 

10 

ccxxnrcACA gagcgcccgg ggattccagg 

GTGTCCCCTG CATATCTTCT CAGCAATAAC 
15 ACCTTCCCTG CTTCTGAGAC TTCAATCTAC 
CCCTGCAATT GGGTCTCTQG CAGGCAATAG 
ACACCGGGAT GGATGGAGGG AGAGCAGAGG 

20 

TGGGCAGCAG AGGCAACTCC CGCATCCTTT 
CGAGGTGGGT TGGAGACTCA GCAGGCTCCG 
25 GGTCATAACG AGAGTGGGAA CTCAACCCAG 
CGGAAACCAA CCAAACCGTG CGCTGTGACC 
TCAACAACAA CAGAAAAAAG GAATAAAATA 

30 

AAAAAAAAAA AAAAAAAAAA CTCGA 



391 

GCGTAGGGTA AGAGCACGGG CAGTGGTCTT 1080 

GCACTTTGTC ATCATTCTTC ATGGACTCCT 1140 

TATCCCACCT GATCCCAGTC TGAAGGTCTC 1200 

GGTGAGCCCA GCGTTGACX3T CAGGCAGGCT 1260 

GGCTTCCACG AGGAGTCCCr ATCTGCCCCG 1320 

CCCAGGGCTT CTACTCTGCC CCTGGGGAAT 1380 

TCCATGGGCT CTGGGACCCT ACCCCTTCCA 1440 

AGCCCAGCTC ATCCAGATGC AGACTACAGT " 1500 

TTGAAGGACT CXrrGTTCCGT TGGQGCCAGC 1560 

CCTTTGCTTC TCTGCCTACG TCCCCTTAGA 1620 

GCTCTGCCTG TCRGTGGTCA GAGCGGTGAG 1680 

TGCAGCCCTT GGGAACAGTG AGAGGTTGAA 1740 

ATCCXTGCCCC TCCTGTCCTC TGTGTTCCCG 1800 

CATTGCTGTT CTCTGTATCG TGATCTATCC 1860 

TCXrrTTGTTT CXrTAGTGAAA AAAAAAAAAA 1920 

1945 



35 

(2) INFORMATION FOR SEQ ID NO: 165: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 2933 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEE3NESS ; double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

45 

GGGTCGACCC ACGCGTCCGG CAGCCGTCGT TTGAGTCGTT GCTGCCGCTG CCCCCTCCCG 60 

GATCAGGAGC CAGTGTATAC CGCCCGCCCA CCGCCTTGGT GCCGCTAGAG GAAACGAGAA 120 

50 GGAGGCCGCC TGCGGTTTGT CGCCGCAGCT CGCCCMCYGY CYGGRAGAGC CGAGCCCCGG 180 

CCCAGTCGGT CGCYTGCCAC CSCTCGTAGC CGTTACCCGC GGGCCGCCAC AGCGGCCGGC 240 

CGGGAGAGGC GCGCGCCATG GCYTCTGGAG CCGATTCAAA AGGTGATGAC CTATCAACAG 300 

55 

CCATTCTCAA ACAGAAGAAC CGTCCCAATC GGTTAATTGT TGATGAAGCC ATCAATGAGG 360 

ACAACAGTGT GGTGTCCTTG TCCCAGCCCA AGATGGATGA ATTGCAGTTG TTCCGAGGTG 420 

60 ACACAGTGTT GCTGAAAGGA AAGAAGAGAC GAGAAGCTGT TTGCATCGTC CTTTCTGATG 480 
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ATACTTGTTC TGATGAGAAG ATTCGGATGA ATAGAGTTGT TCGGAATAAC CTTCGTGrTAC 540 

GCCTAGGGGA TGTCATCAGC ATCCAGOCAT GCCCTGATC5T GAAGTACGGC AAACGTATCC 600 

5 

ATGTGCTGCC CATTGATGAC ACAGTGGAAG GCATTACTGG TAATCTCTTC GAGCTATACC 660 

TTAAGCCGTA CTTCCTGGAA GCGTATCGAC CCATCCGGAA AGGAGACATT TTTCrTOTCC 720 

10 GTGGTGGGAT GCGTGCTGTG GAGTTCAAAG TGGTGGAAAC AGATCCTAGC CCTTATTQCA 780 

TTGTTGCTCC AGACACAGTG ATCCACTGCX; AAGGGGAGCC TATCAAACGA GAGGATGAGG 840 

AAGAGTCCTT GAATGAAGTA GGGTATGATG ACATTGGTGG CTGCAGGAAG CAGCTAGCTC 900 

15 

AGATAAAGGA GATGGTGGAA CTGCCCCTGA GACATCCTGC CCTCTTTAAG GCAATTGGTG 960 

TGAAGCCTCC TAGAGGAATC CTGCTTTACG GACXTIXXTOG AACAGGAAAG ACXXTTGATTG 1020 

20 CTCGAGCTGT AGCAAATGAG ACTGGAGCCT TCTTCTTCTT GATCAATGGT CCTGAGATCA 1080 

TGAGCAAATT GGCTGGTGAG TCTGAGAGCA ACCTTCGTAA AGCCTTTGAG GAGGCTGAGA .1140 

AGAATGCTCC TGCCATCATC TTCATTGATG AGCTAGATGC CATCGCTCCX: AAAAGAGAGA 120O 

25 

AAACTCATGG CGAGGTGGAG CGGCGCATTG TATCACAGTT GTTGACXXrrC ATGGATGGCC 1260 

TAAAGCAGAG GGCACATGTG ATTGTTATQG CAGCAACCAA CAGACCCAAC AGCATTGACC 1320 

30 CAGCTCTACG GCGATTTGGT CGCTTTGACA GGGAGGTAGA TATTC3GAATT CCTGATGCTA 1380 

CAGGACGCTT AGAGATTCTT CAGATCCATA CCAAGAACAT GAAGCTQGCA GATGATGTGG 1440 

ACCTGGAACA GTAGCCAATG AGACTCACGG GCATGTGGGT GCTGACTTAG CAGCXrCTGTG 1500 

35 

CTCAGAGGCT GCTCTGCAAG CCATCCGCAA GAAGATGGAT CTCATTGACC TAGAGGATGA 

GACCATTGAT GCCGAGGTCA TGAACTCTCT AGCAGTTACT ATGGATGACT TCCGGTGGGC 1620 

40 CTTGAGCCAG AGTAACCCAT CAGCACTGCG GGAAACCGTG GTAGAGGTGC CACAGGTAAC 1680 

CTGGGAAGAC ATCGGGGGCC TAGAGGATGT CAAACGTGAG CTACAGGAGC TGGTCCAGTA 1740 

TCCTGTGGAG CACCXIAGACA AATTCCTGAA GTTTGGCATG ACACXTTTCCA AGGGAGTTCT 1800 

45 

GTTCTATGGA CCTCCTGGCT GTGGGAAAAC TTTGTTGGCC AAAGCCATTG CTAATGAATG 1860 

CCAGGCCAAC TTCATCTCCA TCAAGGGTCC TGAGCTGCTC ACCATGTGGT TTGGGGAGTC 1920 

50 TGAGGCCAAT GTCAGAGAAA TdTTGACAA GGCCCGCCAA GCTGCCCCCT GTGTGCTATT 1980 

CTTTGATGAG CTGGATTCGA TTGCCAAGGC TCGTCGAGGT AACATTGGAG ATGGTGC3TGG 2040 

GGCTGCTGAC CGAGTCATCA ACXAGATCCT GACAGAAATG GATGGCATGT CCACAAAAAA 2100 

55 

AAATGTGTTC ATCATTGGCG CTACCAACCG GCCTGACATC ATTGATCCTG CCATCCTCAG 2160 

ACCTGGCCGT CTTGATCAGC TCATCTACAT CXXTACTTCCT GATGAGAAGT CCCGTGTTGC 2220 

60 CATCCTCAAG GCTAACCTGC GCAAGTCCCC AGTTGCCAAG GATGTGGACT TGGAGTTCCT 2280 
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GGCTAAAATG ACTAATGGCT TCTCTGGAGC TGACCTGACA GAGATTTGCC AGCGTGCTTG 2340 

CAAGCTGGCC ATCCGTCAAT CCATCGAGAG TGAGATTAGG CGAGAACGAG AGAQGCAGAC 2400 

5 

AAACCCATCA GCCATGGAGG TAGAAGAGGA TGATCCAGTG CCTGAGATCC GTCGAGATCA 2460 

CTTTGAAGAA GCXATGCXSCT TTGCGCGCCG TTCTGTCAGT GACAATGACA TTCGGAAGTA 2520 

10 TGAGATGTTT GCCCAGACCC TTCAGCAGAG TCGGGGCTTT GCSCAGCTTCA GATTCCCTTC 2580 

AGC3GAACCAG GGTGGAGCTG GCXTCXIAGTCA GGGCAGTGGA GGCGGCACAG GTGGCAGTGT 2640 

ATACACAGAA GACAATGATC ATGACCTCTA TGGCTAAGTG GTGGTCGCCA GCGTCCAGTG 2700 

15 

AGCTGGCCTG CCTGGACXrTT GTTCCrTQGG GGTTGGGGGCXS CTTGCXXAGG AGAGGGACCA 2760 

GQGGTGCGCC CACAGCCTGC TCCATTCTCC AGTCTGAACA GTTCAGCTAC AGTCTGACTC 2820 

20 TGGACAGGGG GTTTCTGTTG CAAAAATACA AAACAAAAGC GATAAAATAA AAGCGATTTT 2880 

CATTTGGTAA AAAAAAAAAA AAAAAAAAAT CXX3GGGGGGG GCCOGAACCA TTT 2933 



25 



(2) INFORMATION FOR SBQ ID NO: 166: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2243 base pairs 

(B) TV1>E: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

-TCGGAGAGCC- GGCGGGCGNG-CGCCTGTGGG--GGAQGAAGGG-GCTGTTQGAG-GGGT^ -60- 

GATGCCCAGA AGTGGCCTTG GGCTGGGGAT CACCATAGCT TTTCTAGCTA CGCTGATCAC 120 

40 

GCAGTTTCTC GTGTATAATG GTGTCTATCA GTATACATCC CCAGATTTCC TCTATATTCG 180 

TTCTTGGCTC CCTTGTATAT TTTTCTCAGG AGGCGTCACG GTGGGGAACA TAGGACGACA 240 

45 GTTAGCTATG GGTGTTCCTG AAAAGCCCCA TAGTGATTGA GTCTTCAAAA CCACCGATTC 300 

TGAGAGCAAG GAAGATTTTG GAAGAAAATC TGACTGTGGA TTATGACAAA GATTATCTTT 360 

TTTCTTAAGT AATCTATTTA GATCGGGCTG ACTGTACAAA TGACTCCTGG AAAAAACTCT 420 

50 

TCACCTAGTC TAGAATAGGG AGGTGGAGAA TGATGACTTA CCCTGAAGTC TTCCCTTGAC 480 

TGCCCGCACT GGCGCCTGTC TGTGCCCTGG AGCATTCTGC CCAGGCTACG TGGGTTCAGG 540 

55 CAGGTGGCAG CTTCCCAAGT ATTCGATTTC ATTCATGTGA TTAAAACAAG TTGCCATATT 600 

TCAAAGCCTT GAACTAAGAC TCAATTACCA ACCCGCAGTT TTGTGTCAGT GCCCAAAGGA 660 

GGTAGGTTGA TGGTGCITAA CAAACATGAA GTATGGTGTA ATAGGAATAA TATTTATCCA 720 

60 
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AAAGATTTTT AAAAATAGGG CTGTGTTTAA AAAAAAAAAC AAAACARGAA AAGCAGCAGT 780 

GATTATAGAG AGGTCACACT CTAAGTQGGG TCGCGGCGTG GCCACGCTTC ACGGTCACGC 840 

TCGTCTGTCC TGCAGTGGCG TGITTACATG GTCACACX3TG TGTGTATCAC CAGTGGGTCA 900 

ACTGCTTGTC ATTCCTCCCG TGGCAGTTTG TGTAGACAAT CTTACTGAGC AAAAGGCAAT 960 

GAAAAGTCTT GGTTCCCACA CTGCGATATA TTGGAATTTT CACCTCAGTT TATGAAGTTT 1020 

ATTTCGAAAT CXM'AGTCAT CTAAGAATGA ATACCTGTCT GCCATGTATT TCAATCTTAG 1080 

TGAGCXAAAA TTGTTTGTTT GTTACTACAG AATAGAGATG ACTGTTTTTT GCCACAGCCC 1140 
15 TATGGRATIT GCAATCTGTG ATTGCCTTGT AAAAAGGAGA GTGCATATGG CACTGCATTA ' 1200 

AACGTGTGGT GTTTCTAGTC AATGATATTG GTGAGCACAA TGTATTCATT TAATGGCATA 1260 

GACCATACCA GACCTAATTT GCAAGTATTG GGTCTTAAAC TTCAAGTGCA ATGTATATGA 1320 

20 

AAACCAATCT GAGCCTTGTA TCTCTTAAAT ATTTATTTTT TTTAACGTGT GAGATGTTCG 1380 

AGAGAAGGTT CTCCATTCAT TTCAGTGCTG CCTGGAGGAA ACTCGGCAAT GATTTCTTTC 1440 

25 AGTTGTGAAG TTCCTTTCGT OTTACACXXTT CXIACTGAACC CTCAACCTTC GAAATACTCC 1500 

AGTTTTGTGG GTTTGGTCAT TTTTACTTAT AAATTTACCT TTTTGTATTT TGCAATTTAC 1560 

ATGTGTTTGG TTTGmTAA ATTCTGTGAA AGTGGCTTGA TTAAAAGACT CCTTTTAAAT 1620 

30 

GGAAGCCACC AGTCAGCAGA ATGGAAGCTT AGAGGAACTT GCCTGTGAGC GCTGGTCTTT 1680 

GTGTTTGGTT TTGTGATGTA ACGATdTTG CTGGGG1TTT TTGCTTTGTT TTGAGGGAAA 1740 

35 TGTCTTQGAG TAAATTTTAA GTTCCTGGAG TTAATTTGTT TTACAGGAAT TTTGTTrrTT 1800 



40 



50 



55 



AAAAAAATAG GATCATTCTG AACTTTGGAA TGACCCCXTTT ATATATTTTC TGAAAATGAA 1860 

AACAGTTACA TGAAAAAAAT TTCCAATGAA GATGTCAGCA TTTTATGAAA AACCAGAAGT 1920 

TATTAGATGA AAGCAGCX3AG TGAATCTTTA AAACAGACTT GATCACGCAC ACACAATAAG 1980 

TCTTTCTCTC CGAAACCGGA AGTAAATCTA TATCTGTTAG AAATAATGTA GCCAAAAGAA 2040 

45 TCTAAATTTG AGGATTTTTT TQCCAATAGT TTATAGAAAA TATATGAACXT AAAGTGATTT 2100 

GAGTTTGTAA AAATGTAAAA TAGTATGAAC AAAATTTGCA CTCTACCAGA TTTGAACATC 2160 

TAGTQAQGTT CACATTCATA CTAAGTTTTC AACATTGTGT TCTTTTTGCA TTCATTTnT 2220 

ACTTTTATTA AAGCTTTCAAA ACC 2243 



(2) XNFORMATIC»J FOR SEQ ID NO: 167: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQTOTH: 1816 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 167: 
GGTGOGNAGC TTTNAATTTC CCCTTACWGG GGCGCTOTAA GGGGAAACCT TCCCGGAATT 
TTCGGGTCGA CCCACGCGTC CGGCCAGCCT AGGAGAAGAA GTTCGTAGTC CCAGAGGTCA 
GGCAGGAGGC GGCAGTTTCT GGCGGGTGAG GGCGGAGCTG AAGTGACAGC GGAGGCGGAA 
GCAACGGTCG GTGGGGCGGA GAAGGGGGCT GGCCCCAGGA GGAGGAGGAA ACCCTTCCGA 
GAAAACAGCA ACAAGCTGAG CTGCTGTGAC AGAGGGGAAC AAGATGGCGG CGCCGAAGGG 
GAGCCTCTGG GTGAGGACCC AACTGGGGCT CCCGCCGCTG CTGCTGCTGA CCATGGCCTT 
GGCCGGAGGT TCGGGGACCG CTTCGGCTGA AGCATTTQAC TCGGTCTTGG GTGATACGGC 
GTCTTGCCAC CGGGCCTGTC AGTTGACCTA CCCCTTGCAC ACCTACCCTA AGGAAGAAGA 
GTTGTACGCA TGTCAGAGAG GTTGCAOGCT GTTTTCAATT TGTCAGTTTG TGGATGATGG 
AATTGACTTA AATCGAACTA AATTGGAATG TGAATCTGCA TGTACAGAAG CATATTCCCA 
ATCTGATGAG CAATATGCTT GCCATCTTGG KTGCCAGAAT CAGCTGCCAT TCGCTGAACT 
GAGACAAGAA CAACTTATGT CCCTGATGCC AAAAATGCAC CTACTCTTTC CTCTAACTCT 
GGTGAGGTCA TTCTQGAGTG ACATGATGGA CTCCGCACAG AGCTTCATAA CCTCTTCATG 
GACTTTTTAT CTTCAAGCCG ATGACGGAAA AATAGTTATA TTCCRGTCTA AGCCCAGRAA 
TCCCAGGTAC GCACCACATT TGGAGCCAGG AGCCCTACCA AATITGRGRG RAWCMTCTCT 
AAGCAAAATG TCCNTCAKMT CGSMAATGAG AAATTCACAA GCGCACAGGA ATTTTCrTGA 



AGATGGAGAA AGTGATGGCT TTTTAAGATG CCTCTCTCTT AACTCTGGGT GGATTTTAAC 
TACAACTCTT GTCCTCTCGG TGATGGTATT GCTTTGGATT TGTTGTGCAA CTTGTTGCTA 
CACGCTGTTG GACGCAGTAT AGTTTCCCTC TGAGAAGCTG AGTATCTATG GTGACTTGGA 
GTTTATGAAT GAACAAAAGC TAAACAGATA TCCAGCTTCT TCTCTTGTGG TTGTTAGATC 
TAAAACTGAA GATCATGAAG AAGCAGGGCC TCTACCTACA AAAGTGAATC TTGCTCATTC 
TGAAATTTAA GCATTTTTCT TTTAAAAGAC AAGTGTAATA GACATCTAAA ATTCCACTCC 
TCATAGAGCT TTTAAAATGG TTTCATTGGA TATAGGCCTT AAGAAATCAC TATAAAATGC 
AAATAAAGTT ACTCAAATCT GTGAAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG 
GCCCGTTACC AAKTCGCCCT ATOGTGAETB GTATTMTTAT TTTACTAATA TCTGTAGCTA 
TTTTGTTTTT KGCTTKGGTT ATKGTTTTTY TCCCTTYTCT WAGCTATRAG CTGATCATKG 
CYSCTOCTCA CCTCCTGCCA TGATACTGTC AGTTACCTTA GTTAACAAGC TGAATATTTA 
GTAGAAATGA TGCTTCTGCT CAGGAATQGC CCACAAATCT GTAATTTGAA ATTTAGCAGG 



60 
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300 
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480 
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AAA.TGACCTT TAATGACACT ACATTTTCAG GAACTGAAAT CATTAAAATT TTATTTGAAT 1740 

AATTATGTGC TGAAAAAAAA AAAAAAAAAA AMWMRARASK RRWWACTCGA QGGGGGGCCC 1800 

GGTACCCNAT TCGCCXS 1816 

(2) INFORMATION FOR SEQ ID NO: 168: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

AGAAACCGTT GATQGGACTG AGAAACCAGA GTTAAAACCT CTTTGGAGCT TCTGAGGACT 60 

CAGCTGGAAC CAACGGGCAC AGTTGGCAAC ACCATCAACT TCTCCCAAGC AGAGAAACCC 120 

25 GAACCCACCA ACCAGGGGCA GGATAGCCTG AAGAAACATC TACACGCAGA AATCAAAGTCT 180 

ATTGGGACTA TCCAGATCTT GTGTGGCATG ATGGTATTGA GCTTOGGGAT CATTTTGGCA 240 

TCTGCTTCCT TCTCTCCAAA TTTTACCCAA GTGACTTCTA CACTGTTGAA CTCTGCTTAC 300 

30 

CCATTCATAG GACCCTTTTT TTTTATCATC TCTGGCTCTC TATCAATCGC CACAGAGAAA 360 

AGC3TTRACCA AGCTTTTGGT GCATAGCAGC CTGGTTGGAA GCATTCTGAG TGCTCTGTCT 420 

35 GCCCTGGTGG GTTTCATTAT CCTGTCTGTC AAACAGGCCA CCTTAAATCC TGCCTCACTG 480 



CAGTGTGAGT TGGACAAAAA TAATATACCA ACAAGAAC3TT ATGTTTCTTA CTTTTATCAT 540 

GATTCACTTT ATACCACGGA CTGCTATACA GCCAAAGCCA GTCTGGCTQG AWCTCTCTCT 600 

CTGATGCTGA TTTGCACTCT GCTGGAATTC TGCCTAGCTG TGCTCACTGC TGTGCTGCQG 660 

TGGAAACAGG CTTACTCTGA CTTCCCTGGG AGTGTACTTT TCCTGCCTCA CAGTTACATT 720 

45 GGTAATTCTG GCATGTCCTC AAAAATGACT CATGACTGTG GATATGAAGA ACTATTGACT 780 

TCTTAAGAAA AAAGGGAGAA ATATTAATCA GAAAGTTGAT TCTTATGATA ATATGGAAAA 840 

GTTAACCATT ATAGAAAAGC AAAGCTTGAG TTTCCTAAAT GTAAGCTTTT AAACTAATGA 900 

ACATTAAAAA AAACCATTAT TTCACTGTCA TTTAAAGATA ATGTG 945 



55 



(2) INFORMATION FOR SEQ ID NO: 169: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEl^GTH: 902 base pairs 
60 (B) TYPE: nucleic acid 
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397 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

^ <xi) SEQUENCE DESCRIPTION: SBQ ID NO: 169: 

GGCAGAGCCA CAGGAAGGAT GAGGAAGACC AGGCTCTGGG GGCTGCTGTG GATGCTCTTT 60 

GTCTCAGAAC TCCGAGCTGC AACTAAATTA ACTGAGGAAA AGTATGAACT GAAAGAGGGG 120 

10 CAGACCCTGG ATGTGAAATG TGACTACACG CTAGAGAAGT TTGCCAGCAG CCAGAAAGCT 180 

TGGCAGATAA TAAGGGACGG AGAGATGCCC AAGACCCTGG CATGCACAGA GAGGCCTTCA 240 

AAGAATTCCC ATCCAGTCCA AGTGGGGAGG ATCATACTAG AAGACTACCA TGATCATGGT 300 

15 

TTACTGCGCG TCCGAATGGT CAACCTTCAA GTGGAAGATT CTGGACTCTA TCAGTOTGTG 360 

ATCTACCAGC CTCCCAAGGA GCCTCACATG CTGTTCGATC GCATCCGCTT GGTGGTGACC 420 

20 AAGGGTTTTT CAGGGACCCC TGGCTCCAAT GAGAATTCTA CCCAGAATGT GTATAAGATT 480 

CCTCCTACCA CCACTAAGGC CTTGTGCCCA CTCTATACCA GCCCCAGAAC TGTGACCCAA 540 

GCTCCACCCA AGTCAACTGC CGATGTCTCC ACTCCTGACT CTGAAATCAA CCTTACAAAT 600 

25 

GTGAGAGATA TCATCAGGGT TCCGGTGTTC AACATTGTCA TTCTCCTGGC TGGTGGATTC 660 

CTGAGTAAGA GCCTGGTCTT CTCTGTCCTG TTTGCTGTCA CGCTGAGGTC ATTTGTACCC 720 

30 TAGGCCCACG AACCCACGAG AATGTCCTCT GACTTCCAGC CACATCCATC TGGCAGTTGrr 780 

GCCAAGGGAG GAGGGAGGAG GTAAAAGGCA GGGAGTTAAT AACATGAATT AJ^ATCTGTAA 840 

TCACCRGCTA AAAAAAAAAA AAAAAAAACN CGANCCTNGG TTTTCAGCTC CATCAGCTCC 900 

TT 902 
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<2) INFORMATION FOR SEQ ID NO: 170: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1883 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

AGAAAACAAC TGAAAAACCA CATTTTTCTA CATACAGCTG GGGAGGTAGC TGAGAACTTG 60 

GCACTGCGCA CACATACTAG GTTGAAAGAG AGTTGAGGAA ACCAGAAGGC CAAGTGGATC 120 

55 TGCTGGCAAA CCCTGAACCT GTCTCCTGCG CTTGCTCTAC AGTTCTGAAG TTGAAAATCC 180 

TTTTCATGCC TAGCATCTGC TTGAGTTATA AACCCCAAGG CAGCCATGTC ATAGACTAGT 240 

GTTTACTCTT GTTTTGACTT TGTTTTAATG CTTCCTAAGA CCCAAGTGCC TCCTGCTGTT 300 

60 
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TCCTCCTTTG TGGTAGCCTC TGGCCATCTG GGACCTCAAT CCCCAGCTTT CCXrAdTTCA 
GCAGTCCTTT GCTCTCTTTG CTTCTACCTC AAATAGCCCC AGGAGTOGGC TTTAGTCTCC 
AATATGGAGC ATYTCAAGCT TCTCCTQGGG GATGGGGATT GGGATGGGCA GAATCTGTTT 
TGGWTCTCCG GGTTATTTCC AGTGGGTGTA AAAGCAGAGC TGGGCCTITC CCTCTCTTAT 
CXXTTGAGGGT GGGTAAGAAG GACTGTATCT ACACCTGTTC TTCCCTACCT TCTCTTTTGT 
TAGGGAGGCC TCATTCTAAG TTCCTCAAGA GAGTCCTTGG CTTAAAGCTG TAGCAAGGGT 
GTGCTAGGTG GGGGATTTQG AGCAAAACCG TCGAGTAGGC ATGATACTGG TATOGAGTOG 
GCCTGCAAAA TCAGACAGAA ATGGCTTGAG AAGCCGCAGG GGAGCATGCC TGTCTCTCAG 
TGATAGAGTA TGQGAGGGAC CTCCCTAGCT TGGAAAATGA GAATTGAAGG GGTTATGAAC 
AAATAGGATG CCTAGTTGAG GATGTTCCCA AAGTTTTGTC CAATCTTATC ATTAGTAGAT 
TTTATAAGCC ACAGAGACAA ACCAGAAACG GAATAATGTT ACTTTGGATG CTTTATTTTT 
TTGTTCTAGG TCTrGGCTTTG TACATGCAGA AGAATGCTAT ATGCTGCACA TTTTGCCTTT 
AAAGTCTTAC GACTTTCCCC ATTTTAGTCT AATGGGAAGA TACAGATGTG CAAGTCTGCT 
TTTTTGTTTT TTGTTATTAT TrTTTriTTT TIX3CTCTGTG TTATGGACAT TTTCAGACAT 
GCACAGAAGT GGAGAGGATG GTCCTTGGAC CCCATGTGTC CATCACCTAG CTGCATCACT 
TATCAGCTAT GGTCAACCTG GTTTCATCTG TATCTCTCTC TTTTCACCTG TATTGTTTAT 
TGAAAATCCA AGACACTATG CCAATGCAAC CGTGACTACT TTGQGAGATT GGTAGTCTCT 
TTTGATGGTG ATAGTGATGG GGTGCACTAT CATAATCACA TCAGGTCTGC TTTTTGCTTT 



TAATGTTAAC TAATGAAGTT CCAGAGATGG GCCTTAGAAA TGTGTTTTAA GAATTAACAA 
GGAGTCTCAA AAAGAAATGA GAGGGATGCT TCCTTTCCCC TTGCATCTAC AAAACAAGAG 
AGAGACTGTT CTGTTGTAAA ACTCTTTCAA AAATTCTGAT ATGGTAAGGT ACTTGAGACC 
CTTCACCAGA ATGTCAATCT TTTTTTCTGT GTAACATGGA AACTTGTGTG ACCATTAGCA 
TTGTTATCAG CTTGTACTGG TCTCATAACT CTGGTTTTGG AAGAATAATT TGGAAATTGT 
TGCTGTGTTC TGTGAAAATA ACCTCCCCAA AATAATTAGT AACTGGTTGT TCTACTTGGT 
AATTTGACAC CCTGTTAATA ACXSCAATTAT TTCTGTGTTC TTAAACAGTA TAAATAGTTG 
TAAGTTTGCA TGCATGATGG AAAAATAAAA ACCTGTATCT CTGTTAAT^ AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAA 
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(2) INFORMATION FOR SEQ ID NO: 171: 
(i) SEQUENCE CHAR?«rrERISTICS : 
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<A> LENGTH: 2100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 171: 
TACTTTTAGA TTTACTGCCT TCAAAAAGTG CCTATTCTGA GCAACATAAA CGTTATTCCT 60 
10 TACATATGTA TGTACACACG GTACCCAGAG TCGTACTGTG GCAGCCTTCA AAAACATACC 120 
ATCAGAAAGA GTAGGTGCTG AGATAAGOJA ACTTTGCCAA ATGNAAGAAA GTCACTCACT 180 
TCCAATATCC CCTCTTCAAG CGGCTACCGT C31AASGGGCT GCAAACACAT TCCCTGAGCA 240 

15 

TCCCTTGCTG ATACAGCTTC TTTATATTTA TATCCTACTG GATGGTAGCA TATTGCTAAG 300 
GTTTCCTGTA CTCTGCTTCA AGGGAATGTA AGYTTTATGG CATTGAAACA TTTAGGAAAA 360 
20 AAAAAGATGT TTAAGAGAAT TAATAGAGCC GTAGTCTGTA TTAGGATGTG TGTCATATGT 420 
GTGTTCTATA AACTAAGCAT CGGTGGGTTT AGAGTGTTAA AGTGTCAGCA CATTCCTTCT 480 
CCTTTTGTCT CTCAGGCTAA CATGAGAGAA AATAGAAAAG TCTTGGCTGT GGGGATTGGA 540 

25 

AGCTCAGGGG GCCAAATGTC CTTGCCAGAT CCTTAGAGCA TTACTTTGAC TCCTAAAAAT 600 
AGTAGTGTAT GTTATTTGAT GGCTTTTGTT TCCATAGTTC CATCACTGAC AAAACTGTCA 660 
30 ATACTGTTGA TQGAGCAGCA GCATAGCCTA GAGTGATGCA TTCTTACCCA GAGGTGQCAA 720 
TAGGAGAGGG TCCATGTAAA TAGGACGAGG TAGACAGTGC ATGATTGTAG GAGAAGGGTT 780 
GAAGGGAGGA CATGATTCCA AAAAAGATCG TTCTCAATGT GTCGTCTGAC TCAACCAGCT 840 
GGCAGATTAC ACTTGCCAAG TCGTTCCCTT TCCTTCTAAG TCAGTTGGCT CCATATTCAC 900 



35 



TTGAATATGC CTCTGTTTGG GCAAAGCAAG ATACCTCCAC TTAACCTTTA TCCAAGGAAG 960 

40 CTCTTGGTGT CCTCTTGGTC ATAAAGTTGT CTCCTACCTA ACCCAGTTTT ACCAAATGGA 1020 

AGTAAAAGGG GACAAACTAT GGAAGATGGA CTCCATGCCA TTGCAGTCAG CCACCATTCT 1080 

CTTTTCCATA TAAQGAGCCC CATTACATAA GCTACGGGTG AGGTTGGAAC AGCTATGTTT 1140 

45 

CATAATTTCA AGAGTGTGAC CACCCTGCTC TAGTCATCAT CATTGGATGA ATCCAGTTGA 1200 

CTCTTTGGCA AAAGGGTGAT ACTTTTCACT AAAAATGCCT ACTCTTCCTG TTGATGTTCC 1260 

50 TITTCTGTTt TTACCTTGTC CAATTTCCAC ACTAGTCATT TTTTTTATTT TTTAGAGGAT 1320 

CAGATTTTAG CGCTGGAAAA TGAGTTCAAA AATTTCAGTG TAATGTCATA AGGATGTTGG 1380 

GATACAGAGA TTTTTTTTTT CCTTGGAAAC AAATQGACTG GGAAGAAACA CAGCATGGCT 1440 

55 

TTGCTCTGAG TTTCAATCTG ATGATTATGA CCATGGAAGA TAGTCTTATG TAAAGGTTAA 1500 

ATGGTGTTTA CAAGTGGATA GATAAGGCGG AGATGGTGAG AAGCCGGGTT TTCTCTATGC 1560 

60 TAAATGTGTC TACTAAGAGC AGCACTTCCT ACTAGCTAAG CACAATCATA GCCCCACCGT 1620 
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GATGAGCTGC TAGTCTGAAT AACATTCCCT GACTTAGGGA AAGGCACACA AAAACATATA 1680 

AAGAATATGT CTATTTTCAT ATGTGTGATA CTGACAGAGC CATGGTATTC CTAAAATATA 1740 

5 

GGTTTCTCTT TTTTCTTCTA TTCTTAGCAA ATTGCATTTA TTCACTACAT TACAAACCAT 1800 

CACTGATGTA TCCAAAATAG CACACATAGT TCAGTATGAA AATAAGAGAA TAAAATCTGT 1860 

10 TATAAGCAAG TGATTTAGGT ATTTTCTTTT GTGTTTATGC ATTATCTGAC TATATTAAAA 1920 

CCTGTTTTTC TATTTACCTT CTATCAGTTT TCTCTACCAA TTATCTTTTT TCAATGCTCT 1980 

ATAAGAATGA ATATGGAAAT TATATTTCTT TTTTCTGTAA AAGACTTTCCA ACTACTTTAT 2040 

15 

TATATTTAGA AATCCAATAA ACTTCTTATT ACATTTAAAA AAAAAAAAAA AAAACTOGAA 2100 



20 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1930 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEE»JESS: double 

(D) TOPOliOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

30 

CCTTTGANTG TGGTCCCGGG TGCNGATTGG CAGCGCCTCC GCCGCGGCTC GTGGTTGTCC 60 
CGCCATGGCA CTGTCGCGGG GGCTGCCCCG GGAQCTGGCT GAGGCGGTGG CCGGGGGCCG 120 
35 GGTX3CTGGTG GTGGGGGCGG GCGGCATCGG CTGCGAGCTC CTCAAGAATC TCGTGCTCAC 180 



CGGTTTCTCC CACATCGACC TGATTGATCT GGATACTATT GATGTAAGCA ACCTCAACAG 240 

ACAGTTTTTG TTTCAAAAGA AACATGTTGG AAGATCAAAG GCACAGGTTG CCAAGGAAAG 300 

40 

TGTACTGCAG TTTTACCCGA AAGCTAATAT CGTTGCCTAC CATGACAGCA TCATGAACCC 360 

TGACTATAAT GTGGAATTTT TCCGACAGTT TATACTGGTT ATGAATGCTT TAGATAACAG 420 

45 AGCTGCCCGA AACCATGTTA ATAGAATGTG CCTGGCAGCT GATGTTCCTC TTATTGAAAG 480 

TGGAACAGCT GGGTATCTTG GACAAGTAAC TACTATCAAA AAGGGTCrTGA CCGAGTGTTA 540 

TGAGTGTCAT CCTAAGCCGA CCCAGAGAAC CTTTCCTGGC TGTACAATTC GTAACACACC 600 

50 

TTCAGAACCT ATACATTGCA TCGTTTGGGC AAACTACTTG TTCAACCAGT TGTTTGGGGA 660 

AGAAGATGCT GATCAAGAAG TATCTCCTGA CAGAGCTGAC CCTGAAGCTG CCTGGGAACC 720 

55 AACGGAAGCC GAAGCCAGAG CTAGAGCATC TAATGAAGAT GGTGACATTA AACGTATTTC 780 

TACTAAGGAA TGGGCTAAAT CAACTGGATA TGATCCAGTT AAACTTTTTA CCAAGCTTTT. 840 

TAAAGATGAC ATCAGGTATC TGTTGACAAT GGACAAACTA TGGCGGAAAA GGAAACCTCC 900 
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AGTTCCGTTG GACTGGGCTG AAGTACAAAG TCAAGGAGAA GAAACGAATC CATCAGATCA 960 

ACAGAATGAA CCCCAGTTAG GCCTGAAAGA CX:AGCAGGTT CTAGATGTAA AC5AGCTATGC 1020 

ACGTCTTTTT TCAAAGAGCA TCGAGACTTT GAGAGTTCAT TTAGCAGAAA AGGGGGATOG 1080 

AGCTGAGCTC ATATGGGATA AGGATGACCC ATCTGCAATG GATTTTGTCA CCTCTCCTCC 1140 

AAACCTCAGG ATGCATATTT TCAGTATGAA TATGAAGAGT AGATTTGATA TCAAATCAAT 1200 

GGCAGGGAAC ATTATTCCTG CTATTGCTAC TACTAATGCA GTAATTGCTG GGTTCATAGT 1260 

ATTGGAAGGA TTGAAGATTT TATCAGGAAA AATAGACCAG TGCAGAACAA TTTTTTTOAA 1320 

TAAACAACCA AACCCAAGAA AGAAGCTTCT TCTGCXriTCT GCACTGGATC CTCCTAACCC' 1380 

CAATTGTTAT GTATGTGCCA GCAAGCXIAGA GGTGACTGTG CGGCTGAATO TCCATAAAGT 1440 

GACTGTTCTC ACCTTACAAG ACAAGATAGT GAAAGAAAAA TTTGCTATGG TAGCACCAGA 1500 

TGTCCAAATT GAAGATGGGA AAGGAACAAT CCTAATATCT TCCX3AAGAGG GAGAGACGGA 1560 

AGCTAATAAT CACAAGAAGT TGTCAGAATT TGGAATTAGA AATGGCAGCC GGCTTCAAGC 1620 

25 AGATGACTTC CTCCAGGACT ATACTTTATT GATCAACATC CTTCATAGTO AAGACCTAGG 1680 

AAAGGACGTT GAATTTGAAG TTGTTGGTGA TGCCCCGGAA AAAGTCGGGS CCAAACAAGC 1740 

TGAAGATGCT GCCAAAAGCA TAACCAATGG GCAGTGATGA TGGGAGCTTC AGCCCTCCAC 1800 

CTYCACAGCT TCAAGGAGGC AAGATGGACG TYTCYCATAG TTCATYCGGR TGAAGAAGRT 1860 

TCTCCAATAA TTGCCCGACG TTCATTGAAG GAAGGAGGAG GAGGCCCGCC AAGAGGGGAA 1920 

35 TTTAGGWTTG 1930 



10 



15 



20 



40 (2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1509 base pairs 
(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

50 GGCCCTGGCC TCTGGGCTGA GGCTTGCTAG GGACTCGGGG TGGCTCTAAG GGGCAGGGAT 60 

AGGGCTGGGG AGCGCCGGCC TGTGGCCCTG ACCAGCCCCT TCTCGTGCRG GTTCCACCCC 120 

GATGCAGGTG GTCACGTGCT TGACGCGGGA CAGCTACCTG ACGCACTGCT TCCTCCAGCA 180 

CCTCATGGTC GTGCTGTCCT CTCTGGAACG CACGCCCTCG CCGGAGCCTX3 TTGACAAGGA 240 

CTTCTACTCC GAGTTTGGGA ACAAGACCAC AGGGAAGATG GAGAACTACG AGCTCATCCA 300 

60 CTCTAGTCGC GTCAAGTTTA CCTACCCCAG TGAGGAGGAG ATTGGGGACC TGACGTTCAC 360 
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TGTCGCCCAA AAGATGGCTG AGCCAGAGAA OQCCCCACCC CTCAGCATCC TCCTGTACGT 
GCAGGCCTTC CAGGTGGGCA TGCCACCCCC TGGGTGCTGC AGGGGCCCCX: TGCGCCCCAA 
GACACTCCTG CTCACCAGCT CCGAGATCTT CCTCCTGGAT GAGGACTGTG TCCACTACCC 
ACTGCCCGAG TTTGCCAAAG AGCXX3CCGCA GAGAGACAGG TACCGGCTGG ACGATGGCCG 
CCX3CGTCCXX5 GACCTGGACX: GAGTGCTCAT GGGCTACCAG ACCTACCCGC AGCCCTCACX: 
CTCGTCTTCG ATGACGTGCA AGGTCAOXSAC CTCATGGGCA GTGTCACCCT GGACCACTTT 
GGGGAGGTGC CAGGTGGCXrC GGCTAGAGCC AGCCAGGGCC GTGAAGTCCA GTGGCAGGTG 
TTTGTCCCCA GTGCTGAGAG CAGAGAGAAG CTCATCTCGC TGTTQGCTCG CCAGTCGGAG 
GCCCTGTGTG GCCGTGAGCT GCCTGTCGAG CTCACCGGCT AGCXXTlGGCC ACAGCCAGCC 
TGTCGTGTCC AGCCTGACGC CTACTGGGGC AGGGCAGCAG GCTTTTGTGT TCTCTAAAAA 
TCnTTATCC TCCCTTTGGT ACCTTAATTT GACTGTCCTC QCAGAGAATG TGAACATGTG 
TGTGTGTTGT GTTAATTCTT TCTCATGTTG GGAGTGAGAA TGCCGGGCCC CTCAGGGCTG 
TCGGTGTGCT GTCAGCCTCC CACAGGTGGT ACAGCCGTGC ACACCAGTGT CGTGTCTGCT 
GTTGTGGGAC CGTTGTTAAC ACGTGACACT GTGGGTCTGA CTTTCTCTTC TACACGTCCT 
TTCCTGAAGT GTCGAGTCCA GTCCTTTGTT GCTGTTGCTG TTGCTGTTGC TGTTGCTGTT 
GGCATCTTGC TGCTAATCCT GAGGCTGGTA GCAGAATGCA CATTGGAAGC TCCCACCCCA 
TATTGTTCTT CAAAGTGGAG GTCTCCCCTG ATCCAGACAA GTGGGAGAGC CCGTGGGGGC 
AGGGGACCTG GAGCTGCCAG CACCAAGCGT GATTCCTGCT QCXTrGTATTC TCTATTCCAA 



TAAAGCAGAG TTTGACACCG TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA ATTNCTGCGG 
CCTCAAGGG 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
JL440_ 
1500 
1509 



45 (2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 174: 
55 TCGACCCCAS GCGTCCGTGC TTTTCCACAG AAGGTTAGAC CCTGAAAGAG ATGGCTCAGC 60 
ACCACCTATG GATCTTGCTC CTTTGCCTGC AAACCTGGCC GGAAGCAGCT GGAAAAGACT 120 
CAGAAATCTT CACAGTGAAT GGGATTCTGG GAGAGTCAGT CACTTTCCCT GTAAATATCC 180 

60 
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AAGAACCACG GCAAGTTAAA ATCATTGCTT GGACTTCTAA. AACATCTGTT GCTTATCTAA 240 

CACCAGGAGA CTCAGAAACA GCACCCGTAG TTACTGTGAC CCACAGAAAT TATTATCAAC 300 

GGATACATGC CTTAGGTCCG AACTACAATC TGGTCATTAG CGATCTGAGG ATGGAAGACG 360 

CAGGAGACTA CAAAGCAGAC ATAAATACAC AGGCTGATCC CTACACCACX: ACCAAGCGCTT 420 

ACAACCTGCA AATCTATCGT CGGeTTGGGA AACCAAAAAT TACACAGAGTr TTAATGGCAT 480 

CTGTGAACAG CACCTGTAAT GTCACACTGA CATGCTCTGT AGAGAAAGAA GAAAAGAATC 540 

TGACATACAA TTGGAGTCCC CTGGGAGAAG AGGGTAATGT CCTTCAAATC TTCCAGACTC 600 

CTGAGGACCA AGAGCTGACT TACACGTGTA CAGCXX^AGAA CCCTGTCAGC AACAATTCTC 660 

ACTCCATCTC TGCCCGGCAG CTCTGTGCAG ACATCGCAAT GGGCTTCCGT ACTCACCACA 720 

CCGGGTTGCT GAGCXnCCTG GCTATGTTCT TTCTGCTTGT TCTCATTCTG TCTICAGTGT 780 

TTTTGTTCCG TTTGTTCAAG AGAAGACAAG ATGCTGCCTC AAAGAAAACC ATATACACAT 840 

ATATCATGGC TTCAAGGAAC ACCXAGCCAG CAGAGTCCAG AATCTATGAT GAAATCCTCC 900 

AGTCCAAGGT GCTTCCCTCC AAGGAAGAGC CAGTGAACAC AGTTTATTCC GAAGTCCAGT 960 

TTGCTGATAA GATGGGGAAA GCCAGCACAC AGGACAGTAA ACCTCCTGGG ACTTCAAGCT 1020 

ATGAAATTGT GATCTAGGCT GCTGGGCTGA ATTCTCCCTC TGGAAACTGA GTTACAACCA 1080 

CCAATACTGG CAGGTTCCCT GGATCCAGAT CTTCTCTGCX: CAACTCTTAC TGGGAGATTC 1140 

CAAACTGCCA CATCTCAGCC TGTAAGCAAA GCAQGAAACC TTCTGCTGGG CATAGCnOT 1200 

GCCTAAATGG ACAAATGGAT GCATACCXTTT CCTGAAATGA CTCCCTTCTG AATGAATCAC 1260 

^AAAGCAGGTT- ACCTAGTATA -GTTTIOXAA-^ftCTTCTTCCC ^TraTAGCAC"^TOTAGAAAA 1320" 

TAATATTTTT ATGGCACACT GGGATAAACA AGCAAGATTG CTCACTTCTG GAAGCTCCAT 1380 

ATGACTAGAG GCCTCTTGTG ACTGGAGGTA ACAACCCTGC CCAGTAACTC TCGGAGAAGG 1440 

GGATCAATAT TTTGCACACC TGTAATAGGC CATGGCACAC CAGCCAAGAT GCTCTGCTCA 1500 

CAGTCAGTAT GTGTGAAGAT CCXrTGGTGCG TGGCXTTCAC CACGCATCTT GAGCAAATTA 1560 

GGAAAATGTA CCCTTCGCTT GAGGCAGATG CAGCCCITCC CCCGAGTGCA TCGCTTGGAG 1620 

AGCAGAATGT GGGCTGCATA TAAGCACACT CATCCCTTTG TCTGGGAATC TTTGTCCAGG 1680 

GCATAACAGG CTTAGTAAGT CCAAACACAG ATGACAGTGC TGTGTGGGTC TCTCTCAGAG 1740 

TTGTGGCTCT CAGCCATCTA GACACACTCT CCAAATGGAG TGTTGGAAAA TGTTCTTTCT 1800 

GCAGGGTCTA GAGACTGCTG GGACACTTTT CTTGGAGTGC TACTTCAGAA GCCITATAGG 1860 

ATTTTCTTTC TGGCCAAGAT TTCCTTCTGT ATCACTCCAA GCAGCCICAG CAGAAGAAGC 1920 

AGCCATGCCC AGTATTCCCA CTCTCCAAAA GGAACTGACC AGCTTATATT TCTCACACTT 1980 



SDODD: <WO 9a3S44aA2 l > 



wo 98/39448 



PCT/US98/04493 



10 



404 



CTGGGGAACT GGGTATAATC CAACXIATCAA AATAGAAGAC CTTGCAAGAA GCAGAGTCAT 2040 

TCTCCAGAAG GAACTTGGGA GATGATGGTG CAGATGATGA AACTGGGTTC ATCCCAGTTC 2100 

CAAAGACTCA GAGAACTAGA GTTTAAGCTG AGGCAGAGTG (XGCCACCCT GGCATGCCCC 2160 

ACAAACAGAT CACCAGCCAG CTTACACAGG CATTAACTCT CCTCAATGAG GAAGAATCAT 2220 

TCACAACTGA GCAAGACATT CATATGATCA TTTAAGGAAG TGTTTCCCTT ATGTGTTAGC 2280 

AAGTATAATC GGCTAACTCC TAAATCCCAA TGAATAGTCC TAGGCTGGAC AGCAATGGQC 2340 

TGCAATTAGG CAGATAAAGA CATCAGTCCC AGTAAATGAA TCCATAGACT CATCTAGCAC 2400 
15 CAACTACCAT TAGCACTATG TTAGGAGCTG CAAGGCCCCA AAGTAGAAGA TGTGCATAAT " 2460 

GTCTGCTCTT GTGTAGCTCA GGAGACAATT CCAGCACAGA CACTACAGTT AACGCTGAAC 2520 

TGCAGCTGCA AGTAATAGCA TGAACAGTCA GAAAAATACC TTATGAGGGG GCAGGGCTGA 2580 

20 

AGCTGGGCXrr TGAAGGATGG ATGAAATTTG GATAGAGAAT GAGGAAGACA GAGGGCCTCC 2640 

AAGTGAGAGA AGCATGAAAA ATGAGCAGGG GCCTGGATCA GTGGGGTGTA TTCAGAGCAC 2700 

25 CTCTCCAGAT GCACCATGCA TGCTCACAGT CCCTTGCCTA TGTGTGGCAG AGTGTCCCAG 2760 

CCAGATGTGT GCCCCCACCC CATGTCCATT TACATGTCCT TCAATGCCXyV CCTCAAAAGG 2820 

TACCTCTTCT GTAAAGCTTT CCCTGGTATC AGGAATCAAA ATTAATCAGG GATCTTTTCA 2880 

30 

CACTGOTGTT TTTTCCTCTT TGGTCCTTCT ATCACTAAAA CTCATCTCAT TCAGCCTTAC 2940 

AGCATAACTA ATrATTTGTT TTCCTCACTA CATTGTACAT GTGGGAATTA CAGATAAACX5 3000 

35 GAAGCCKGCT GGGGTGGTGG CTCACXSCCTG TAATCCCAAC ACTTTGQGAG GCCAAGGCAG 3060 

GCGGATCACC TGAGGTCAGG ARTTCGAGAT TARTCTGGCC AACATGGTGA AACXXTCATNT 3120 

NTACTAAAAA TACGAAATTA GCCAGGTGTG GTGGCACACA TCTGTAGTCC CAG 3173 



40 



(2) INFORMATION FOR SEQ ID NO: 175; 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

AAATTCGGCA CAGCTGAGAG GAGACACAAG GAGCAGCCCG CAAGCACCAA GTGAGAGGCA 60 

55 

TGAAGTTACA GTGTGTTTCC CTTTGGCTCC TGGGTACAAT ACTGATATTG TGCTCAGTAG 120 

ACAACCACGG TCTCAGGAGA TGTCTGATTT CCACAGACAT GCACCATATA GAAGAGAGTT 180 

60 TCCAAGAAAT CAAAAGAGCC ATCCAAGCTA AGGACACCTT CCCAAATGTC ACTATCCTGT 240 
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CCACATTGGA GACTCTGCAG ATCATTAAGC CTTTAGATGT GTGCTGCGTG ACCAAGAACC 300 

^ TCCTGGCGTT CTACGTOGAC AGGGTGTTCA AGGATCATCA GGAGCCAAAC CCXIAAAATCT 360 

TGAGAAAAAT CAGCAGCATT GCXAACTCTT TCCTCTACAT GCAGAAAACT CTCCGGCAAT 420 

GTCAGGAACA GAGGCAGTGT CACTGCAGGC AGGAAGCCAC CAATGCCACC AGAGTCATCC 480 

10 ATGACAACTA TGATCAGCTG GAGGTCCACG CTGCTGCCAT TAAATCCCTG GGAGAGCTCG 540 

ACGTCTTTCT AGCCTGGATT AATAAGAATC ATGAAGTAAT GTCCTCAGCT TCATCACAAG 600 

GAACCTGTAT AGTGATCCAG GGATGAACAC CCCTTGTGCG GTTTACTGTG GGAGACAGCC 660 

CACCTTGAAG GGGAAGGAGA TGGGGAAGGC CCCTTGCAGC TGAAAGTCCC ACTGGCTOGC 720 

CTCAGGCTGT CTTATTCCX3C TTGAAAATAG CXAAAAAGTC TACTGTGGTA TTTGTAATAA 780 

20 ACTCTATCTG CTGAAAGGGC CTGCAGGCCA TCCTGGGAGT AAAQGGCTGC CTTCCCATCT 840 

AATTTATTGT GAAGTCATAT AGTCCATGTC TGTGATGTGA GCCAAGTGAT ATCCTGTAGT 900 

ACACATTGTA CTGAGTGGTT TTTCTGAATA AATTCCATAT TTTACCTAAA AAAAAAAAAA 960 

AAAAACTCGA GGGGGGGCCC GTACCCAATT T 991 



25 



30 



40 



(2) TNFORMATIC»J FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
_TOPebeGY^-3rinear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

ACAGCCCTCT TCGGAGOCTG AGCCCGGCTC TCCTCACTCA CCTCAACCCC CAGGCGGCCC 60 

CTCCACAGGG CCCCTCTCCT GCCTGGACGG CTCTGCTGGT CTCCCCGTCC CCTOGAGAAG 120 

45 AACAAGGCCA TGGGTCGGCC CCTGCTGCTG CCCCTRCTGC YCCTGCTGCW GCCGCCAGCA 180 

TTTCTGCAGC CTRGTGGCTC CACAGGATCT GGTCCAAGCT ACCTTTATGG GGTCACTCAA 240 

CCAAAACACC TCTCAGCCTC CATGGGTGGC TCTGTGGAAA TCCCCTTCTC CTTCTATTAC 300 

50 

CCCTGGGAGT TAGCCAYAGY TCCCRACGTG AGAATATCCT GGAGACGGGG CCACTTCCAC- 360 

GGGCAGTCCT TCTACAGCAC AAGGCCGCCT TCCATTCACA AGGATTATGT GAACCGGCTC 420 

55 TTTCTGAACT GGACAGAGGG TCAGGAGAGC GGCTTCCTCA GGATCTCAAA CCTGCGGAAG 480 

GAGGACCAGT CTGTGTATTT CTGCCGAGTC GAGCTGGACA CCCGGAGATC AGGGAGGCAG 540 

CAGTTGCAGT CCATCAAGGG GACCAAACTC ACCATCACCC AGGCTGTCAC AACCACCACC 600 

60 
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ACCTGGAGGC CCAGCAGCAC AACCACCATA GCCGGCCTCA GGGTCACAGA AAGCAAAGGG 660 

CACTCAGAAT CATGGCACCT AAGTCTGGAC ACTGCCATCA GGGTTGCATT GGCTGTOGCT 720 

GTGCTCAAAA CTCTCATTTT GGGACTGCTG TGCCTCCTCC TCTGTGGTGG AGGAGAAGGA 780 

AAGGTAGCAG GGCGCCAAGC AGTGACTTCT GACCAACAGA GTGTGGGGAG AAGGGATGTG 840 

TATTAGCCCC GGAGGACGTG ATGTGAGACC CGCTTGTGAG TCCTCCACAC TCGTTCCCCA 900 

TTGGCAAGAT ACATGGAGAG CACCCTGAGG ACCTTTAAAA GGCAAAGCCG CAAGGCAGAA 960 

GGAGGCTGGG TCCCTGAATC ACCGACTOGA GGAGAGTTAC CTACAAGAGC CTTCATCCAG 1020 

GAGCATCCAC ACTGCAATGA TATAGGAATG AGGTCTGAAC TCCACTGAAT TAAACCACTG 1080 

GCATTTGGGG GCTGTTYATT ATAGCAGTGC AAAGAGTTCC TTTATCCTCC CX:AAGGATGG 1140 

AAAATACAAT TTATTTTGCT TACCATACAC CX:CTTTTCTC CTCGTCCACA TTTTCCAATC 1200 

TGTATGGTGG CTGTCTTCTA TGGCAGAAGG TTTTGGGGAA TAAATAGCGT GANAT<»rrNC 1260 

TGACTOAAAA AAAAAAAAAA AAAAACTCGA 1290 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(x i) SEQUENCE DESCRIPTION: SEQ ID NO : 177: 



TGGGGCCCCT TTTGGATGCT CTGGGTGTTT TTGCCAAGAG TTACAGGATG TCAAGTGTGG 60 

GGAGCTCAGC ACCCTTGCTG TGGACCAGTG AAGGCTGTTC CAGACCAGGT GCTTCCAGAC 120 

ATTTCCAGGC TCCAGGAGAG AGGCTGGGAG CCCCCACAGA AAGCACAGGA AAATGCAAAA 180 

AAAAAACAGT CTTTTTTTTT TTTTTGCTTT TTATTATGAA AACAAAACAA ATGCCCCAGG 240 

AGAAGGGTCC ATGATTACCA GAAACATCAA AGAGTACTTT CTACCATTTT TATTCTGTTG 300 

TGTTGAGGCC AGCATTGCAA TAAACAAGCT AAACTACTTA CATTGGACTC ATTTTCAGTA 360 

ACTGACATTT ACAGGAATAT ACTAGAAACG GCACTAAAAA GTTTAAGAAA AGTTACGGTA 420 

AACTTGCATG CACATCATAC AGAAAAGTAA CATTTTAAAT ATAAAAAAGA AAAACTTCCT 480 

GGAAGCATTA TGCCAGTATT AAGGAACAGT GCTACTCTGG ATGTGACAAA TTCTGTATGT 540 

GGGTGTTACT CTTTCCCAAA AGACICTCAG AGGCGTGAGT GCTGCAAAAG AACAACAACA 600 

AAAACAAACA CACAAAAAAA TGTGTCTTAC AGTTTGTAAG CAAGATGACA CTGCCCAACA 660 

CAAAGAGGGG TCTGGAGTTC AGTTCACGCC CGAAGCCTGC CCCCTCGGCC TCCAGGGGTC 720 
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ATTCAGAGTG TTCTCAAATC CAATTCOGAC ACACGACTTG TCACTACTCC TCTCCCCTTG 
AAAAAAGCAT GTTAGAAGCT GCCCTACAGG TCTCAGCAGT GGGACAATCT AATTGAATCA 
CCGCAGCCTT CTAATACAGA AGAAACGGAC GTGACTGTCA CCCTCAQCCC GCCAGCAAQG 
GOGCTGAGGA AGTCATTAAT CXTTTCGAAAC TCTGAAAAGA AACCAGTGTT GAAGTCTGGA 
CAGAAAGCCT TAAAAAAGTG ACAGCACCAA TGCAGCTGCT CAGTGTACXX: NCCX3TGGGCT 
GTCAQGGTCA GTGGCTTCTT TCTAGATGAA AGGAGCAGAG GCGAGCCGAC GCXIACCXTTCA 
CAGAGAACCA GCCGAGAAGG AAAGGCCCCA CGATGCTCCC TGTGCGCTGC CXXXACAGCC 
GGCCX3CTCCC CCGACGGCTC ACACAGGCAG CACCTCACTG CCCTGTGGCT GGAGGGGCAT 
TGCAAGGAGC GCCCCCCAGC CCCAGGCACC CXTCGGCTTAG GGTGTAOGTA TCACCXy\GCC 
CTGTGCTGGC AGCACX5TTAC CAACCAGCCT GCGTGAAGAC CTGTCAACTG TCGTGTGrTGA 
ATTCCTTAAA TTCGGTTTAA ATAGTCCATT AAAGATCTGT TTAGAAAATA CCTTTGAAAA 
CGAGGGTAAC TTTAAAAAAT GGAAACTTTC AAATCCATTT ATATTTTTAT TATAAACAAA 
ACTTAATTAA AAGTTTAACA AACTGGCTGA AAACTCACCA AGTGTCAGAC TCACX::AGCAA 
TTTAAAAAAT GATAATTTAC CAGCATCTCC TCATCAGAGT TCCCTCTCCA GTAAGGGTAT 
ACCTACATCT GTAAGGGTCA GTGGACTCTG AATCAATTTT ATGGTTGTTT TAAAATCACC 
GTGTATTAGG ATACTAATGA TAGTCCCTAT ATCCATCCT^ AT^TGCTGGC AGAAAGCACT 
GGCCACCATA CAGGACAGAC CACACCACAG CTCCATACCC AGCGTCTGCC TGGAGGCTCC 
CCCACGCTGA GGTCCGGGAG AATGCCTGGT TTCAGTCATT TCCGGACTAA CTGTGACAAC 



GCGTGAGCAG GGAGCACCGT GCGAGTCTCC GGGAGGGAAT CCTCCTGGGG CX:CAGAGACT 

crrccAcccc tggggagggc agacaggctc gggarggcct ggccaggcca ctggaggctg 

GCAGGGAGCA QGCATGTCCA CCCGCAAGCC TGGGAGGCTA ACTCTGGCAT TCCTGGCCGG 
AGCCGCCATG CTCATTQGTG GGCCAGTTTG GGACATCXXrC GTACTCAAAG ACCATATGGC 
AGCCTCTGGG AAAACAAAAC CAAAACATCA CCTTCTATTA AACTCTGTAT ATTATTATTT 
TTTACAATAG AAAGTTAAAA ATCAAGACTT AGATTTACTA TACATTTrrr CTCTCAGATT 
ACAAAGTTTA TATTATATAA CTGGGGTTCC CTAAATTGAT TTCTTTTAAA ACAGTCTTAA 
AGAGACCAGA AGTGAATACA AAAGAACTAA ACAAAATAAA AAATTAGAAT GTGCTGTAGC 
TGAAAGCTGT 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2290 
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(2) INFORMATION FOR SEQ ID NO: 178: 



i^KX^D: <WO_983944aA^_> 
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(i) SEQUENCE CHARACTERISTXCS : 

(A) LQJGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : douiole 

(D) TOPOIiOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 
GGCACGAGCC ATGCCTGGCC TCTCCTTGAT TCTTACAGTC ACTTTGTTGG CTGTTTCTCA 
CTCAGCAGCT ACCTGCATTG TGGCCAAAGG ATGACCTATT CCTTCTCAGG AGGGCAAAAA 
TGTGGAATAG TGTCTGTCCA TGCCTCTCCT CATGGGCTAC CACCTCTGCC ACCGTCGTTA 
ATCAGTAACA ACCAGGAGAG AAGCTGCTGG AACTGACCTC TGGGAACTCC CTGGGATOGT 
TTGGTGCAGG AATGTAGTAG GCATACACGT GGTTGCGTGG ATCTGGGCCC TCCTGATCTC 
AGTAGAGAGG TAAAAGGCCA CCATCTCCTT GACCTCTGGG GAACTCATCC ACAAAGAAGA 
TGTTTCCAAG ATGCTTCTGA AGATTGCCTA AAAATAGCCG GTTTCCACCC CCGTGAATGC 
ATCCATTCTA GAATGCTCCT TCACCAGGAC CAGAGAACTG ATTTACAGAA GTGACATGAA 
AACATTCCAT CCCAGAATTT GCAGTAGCTC AAATTAAGTT TCTAGCTATT AAAAAGAAAA 
AAAAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 179: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 1509 base pairs 
(B) TYPE: nucle ic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
GGCACGAGGG CTCATTCATT CCGCGCCGGG CCTGCCAGAC ACCTGCGCCC TTCTGCAGCC 
GCCCGCCGCA TCCGCCGCCG CAGCCCCCAG CATGTCGGGC CCAGACGTCG AGACGCCGTC 
CGCCATCCAG ATCTGCCGGA TCATGCGGCC AGATGATGCC AACGTGGCCG GCAATGTCCA 
CGGGGGGACC ATCCTGAAGA TGATCGAGGA GGCAGGCGCC ATCATCAGCA CCCGGCATTG 
CAACAGCCAG AACGGGGAGC GCTGTGTGGC CGCCCTGGCT CGTGTCGAGC GCACCGACTT 
CCTGTCTCCC ATGTGCATCG GTGAGGTGGC GCATOTCAGC GCGGAGATCA CCTACACCTC 
CAAGCACTCT GTGGAGGTGC AGGTCAACGT GATGTCCGAA AACATCCTCA CAGGTGCCAA 
AAAGCTGACC AATAAGGCCA CCCTGTQGTA TGTGCCCCTG TCGCTGAAGA ATGTCGACAA 
GGTCCTCGAG GTGCCTCCTG TTGTGTATTC CCGGCANGAG CAGGAGGAGG AGGGCCGGAA 
GCGGTATGAA GCCCAGAAGC TGGAGCGCAT GGAGACCAAG TGGAGGAACG GGGACATCGT 



wo 98/39448 



409 



PCT/US98/04493 



CCAGCCAGTC CTCAACCXrAG AGCXXSAACAC TGTCAGCTAC AGCCAGTCCA GCTTOATCCA 660 

^ CCTGGTGGGG CCTTCAGACT GCACCCTGCA CXX5CTTTGTG CACGGAGGTG TCACCATCAA 720 

GCTCATGGAT GAGGTCGCCG GGATCGTGGC TGCACGCX^^ TGCAAGACCA ACATCGTCAC 780 

AGCTTCCGTG GACGCCATTA AOTTTCATGA CAAGATCAGA AAAGGCTGCG TCATCACCAT 840 

10 CTCGGGACGC ATGACCTTCA CGAGCAATAA GTCCATGGAG ATCGAQGTGT TGGTCGACGC 900 

CGACCCTGTT GTGGACAGCT CTCAGAAGCG CTACXX3GGCC GCCAGTGCCT TCTTCACCTA 960 

CC3TGTCGCTG AGCCAGGAAG GCAGGTCGCT GCCTGTGCCC CAGCTQGTGC CCX3AGACCGA 1020 

GGACGAGAAG AAGCGCTTTG AGGAAGGCAA AGGGCGGTAC CTGCAGATGA AGGCGAAGCR 1080 

ACAGGGCCAC GCGGASCYTC AGCCCTAGAC TCCCTCCTCC TGCCACTGGT GCCTCGAC3TA 1140 

20 GCCATGGCAA CGGGCCCAGT GTCCAGTCAC TTAGAAGTTC CCCCCTTGGC CAAAAACCCA 1200 

ATTCACATTG AGAGCTGGTG TTGTCTGAAG TTTTCGTATC ACAGTGTTAA CCTGTACTCT 1260 

CTCCTGCAAA CCTACACACC AAAGCTTTAT TTATATCATT CCAGTATCAA TGCTACACAG 1320 

25 

TGTTGTCCCG AGCGCCX3GGA GGCGTTGGGC AGAAACCCTC GGGAATGCTT CCX3AGCACGC 1380 

TGTAGGGTAT GGGAAGAACC CAGCACCACT AATAAAGCTG CTGCTTGGCT GGAAAAAAAA 1440 

30 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1500 

AGAAAAAAN 1509 
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AZ) _INEORMATION-^EOR--SEQ^ ID-NO :-i80 : 



(i) SEQUENCE CHARACTEFtlSTICS : 
40 (A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: ISO: 

AGCTGTATCA TAGGAAAGAT GGCCACACCG GCGGTACCAG TAAGTGCTCC TCCGGCCACG 60 

CCAACCCCAG TCCCGGCGGC GGCCCCAGCC TCAGTTCCAG CGCCAACGCC AGCACCGGCT 120 

50 

GCGGCTCCGG TTCCCGCTGC GGCTCCAGCC TGCATCCTCA GACCCTGCGG CAGCAGCGGC 180 

TGCAACTGCG GCTCCTGGCC AGACCCCGGC CTCAGCGCAA NTCCAGCGCA GACCCCAGCG 240 

55 CCCGCTCTGC CTGGTCCTGC TCTTCCAGGG CCCTTCCCCG GCGGCCGCGT GGTCAGGCTG 300 

CACCCAGTCA TTTTGGCCTC CATTGTGGAC AGCTACGAGA GACGCAACGA GGGTGCTGCC 360 

CGAGTTATCG GGACCCTGTT GGGAACTC3TC GACAAACACT CAGTQGAGGT CACCAATTGC 420 

60 
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TTTTCAGTGC CGCACAATGA GTCAGAAGAT 
AATATCTATG AACTGCATAA AAAAGTTrCT 
5 ACGGGCCATG ACATCACAGA GCACTCTGTG 
CCCCAACCCC ATCCACCTCA CTOTQGACAC 
AGCCTACGTC AGCACTTTAA TGGGAGTCCC 

10 

TCTGACAGTG AAATACGCGT ACTACGACAC 
GACCTGCTTT AGCCCCAACA GAGTGATTGG 
15 GGCATCAGCT CGCATCCAGG ATGCCCTGAG 
GTCTGGAAAG GTGTCAGCTG ACAATACTGT 
AGTACXXSAAA ATAGTTCCCG ATGACTTTGA 

20 

TTTGATGGTG ACCTACCTGG CCAACCTCAC 
TGTAAACCTG TGAATGGACC CCAAGCAGTA 
25 CTCAGAAGTG AAGGAGAAAT GGGTTTTTTG 
GTGTGTGACT CTAATAAACG GAGCCTACCT 
SGRGGGGGGG CCCGGTCCCA TTSSCCCTTT 

30 



GAAGTGGCTG TTGACATCGA ATTTGCTAAG 480 

CCAAATGAGC ICATCCTGGG CTGGTACGCT 540 

CTGNATCCAT GAGTACTACA GCCGAGAGGC 600 

AAGTCTCCAG AACQGCCGCA TGAGCATCAA 660 

TGGGAGGACC ATGGGAGTGA TGTTCACGCC 720 

TGAACGCATC GGAGTTGACC TGATCATGAA 780 

ACTCTCAAGT GACTTGCAGC AAGTAGGAGG 840 

TACAGTGTTG CAATATGCAG AGGATGTACT 900 

GGGCCGCTTC CTGATGAGCC TGGTTAACCA 960 

GACCATGCTC AACAGCAACA TCAATGACCT 1020 

ACAGTCACAG ATTGCACTCA ATGAAAAACT 1080 

CACTTGCTGG TCTAGGTATT AACCCCAGGA 1140 

TGGTCTTGAG TCACACTGAG ATAGTCAGTT 1200 

TTTGTAAATT AAAAAAAAAA AAAAAAACCN 1260 

NGTAATTCGT NTTACAATCC CCNGGC 1316 
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(2) INFORMATION FOR SEQ ID NO: 181: 



(i) SEQUENCE CHARACTERISTICS: 

^—l^E^VHi 777 base pairs" 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

GGCATGWKCA GACATGACTT CTATTGCCAG GCTGGTCAAG TGGCAGGGTC ATGAGGGAGA 60 

CATCGATAAG GGTGCTCCTT ATGCTCCCTG CTCTGGAATC CACCAGCGQG CTATCTGCGT 120 

TTATGGGGCT GGGGACTAGA ATTGGATGCT TCAAAACCAT CACCTGTTGG CCAACAAGTT 180 

50 TGACCCAAAG GTAGATGATA ATGCTCTTCA GTGCTTAGAA GAATACCTAC GTTATAAGGG 240 

CCATTCTATT GGGACCTGAA CTTTGAAGAC CACAMTATTG AAGAGGCGTT GCTTACCYGT 300 

TGGGGGCCAA GAGGCATGTT ACCAAACATG GYYCARGAAM YTTGGYKGGG AMCARKKKKG 360 

GKKGGGARRM CMRGGGYTTG SCAAWTTCSK KGGCMWCCYT TTAGGGTAAR RRGGGCKGIW 420 

ATTAGATTGT GQGTAAAGTA GGATCTTTTG CCCTTGCAAA TTTGCTGCCT GGGTGAATGY 480 

60 TGCTTGTrCC TTCTCMACCC CTAACCCTAG TAGTTCCTCC ACTAACTTTC TCACTAAGTG 540 
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AGAATGAGAA CTGCTGTGAT AGGGAGAGTG AAGGAGGGAT ATGTGGTAGA GCACTTGATT 600 

TCAGTTGAAT GCXrTGCTGGT AGCTTTTCCA TTCTGTOGAG CTGCCGTTCC TAATAATTCC 660 

AGGnTGGTA GCGTGGAGGA GAACTTTGAT GGAAAGAGAA CCTTCCCTTC TCTACTGTTA 720 

ACTTAAAAAT AAATAGCTCC TGATTCAAAG TAAAAAAAAA AAAAAAAAAA AAAAAAA 777 



(2) INFORMATION FOR SEQ ID NO: 182: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 baise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
GGCACAGATA ACTATGTACA TGTATTCCTT AAATGTTTTT TTAAGTTTTA TATTCTTCGC 60 
25 ACTGGTCTTC AAATC3TGTAC ATGTGTGCCA GGGAGCAAAT GCCTTCTTCT TTCTGAAATT 120 
GGTdTTTAG ACTGTTCTTT TTTCCCATCT TCTCACCTCC TGCCCCTCCT TCAGGGTACT 180 
TCCGTGGCCA GAACCCCTCC AGGTCAGAGG CAGAAGAGAA GCCTCATGGG TCACAGCAGC 240 

30 

AGATGTGGGC TGGAGATCTA TTCATTTGGT TTTGGCTTGA ATTTTCTGRA TGGTTTACTT 300 
GATCYTGGGA AAGANATATC TTGCCAGGAA AAATGATAGN CCTTGACAAT GTTGAATGAT 360 
35 CCTGCACCAC CTTGAAAGAC ATTTCTAATA TGGTTTGTCA GGCAAAGTGG TTAGTAGTCA 420 

TTTGTGGCCT GAGGTAGAAG TCCTC AGAAA TCAGCAGArr TTACmATaa AATGCTGACT 480- 

TGCCCCTGGA CTGGGCTCTG TGAGAGTGGC CTTCTGCACT GTGCACAGTA GGTGTGAACA 540 

40 

CACCACACCT ACAGGGACCA CGTGGTGGGC TGTGGACTAG CGQCCAAGCT CCCTGCAGGC 600 
CCACTAATAG AATTCAGCTT TTAGCATGGG CTGTTTCATA CTGTTCTGAT GAAACTGATT 660 
45 TGGTTTCTTT CCTCCATACC CCTTCTGCAT TTCAGTGTTT TTGTTTAGTT TTCCTGGTTT 720 
TTAATTATAA CTACAAAATA AAATCTTTAG GCTATTCACC TTAGCTTAGT AAAAAAAAAA 780 
AAAAAAAACT C 791 

50 



55 



(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 
60 (D) TOPOLOGY: lineai: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

AAATTGATTA ACAGCTTGAA AGAAGGCTCT GGTTTTGAAG GCCTAGATAG CAGCACTGCC 60 

5 

AGTAGCATGG AGCTGGAAGA ACTTCGGCAT GAGAAAGAGA TCCAGAGGGA GGAAATACAG 120 

AAGCTGATGG GCCAGATACA TCAGCTCAGA 1CCGAATTAC AGGATATGGA GGCACAGCAA 180 

10 GTTAATGAAG CAGAATCT^ AAGAGAACAG TTACAGGV/TC TGCATGACCA AATAGCTGGG 240 

CAGAAAGCAT CCAAACAAGA ACTAGAGACA GAACTGGAGC GACTGAAGCA GGAGTTCCAC 300 

TATATAGAAG AAGATCTTTA TCGAACAAAG AACACATTGC AAAGCAGAAT TAAAGATCGA 360 

15 

GACGAAGAAA TTCAAAAACT CAGGAATCAG CTTACCAATA AAACTTTAAG CAATAGCAGT 420 

CAGTCTGAGT TAGAAAATCG ACTCXATCAG CTAACAGAGA CTCTCATCCA GAAACAGACC 480 

20 ATGCTGGAGA GTCICAGCAC AGAAAAGAAC TCCCTGGTCT TTCAACTGGA GCGCCTCGAA 540 

CAGCAGATGA ACTCCGCCTC TGGAAGTAGT AGTAATGGGT CTTCGATTAA TATG?rCTGGA 600 

ATTGACAATG GTGAAGGCAC TCGTCTGCGA AATGTTCCTG TTCTTTTTAA TGACACAGAA 660 

25 

ACTAATCTGG CAGGAATGTA CGGAAAAGTT CGCAAAGCTG CTAGTTCAAT TGATCAGTTT 720 

AGTATTCGCC TGGGAATrTT TCTCCGAAGA TACCCCATAG CGCX3AGTTTT TGTAATTATA 780 

30 TATATGGCTT TGCTTCACCT CTGGGTCATG ATTGTTCTGT TGACTTACAC ACCAGAAATG 840 

CACCACGACC AACCATATGG CAAATGAACC AAGCCCAGTT GTTGCAGTGA TrGGTTGTCT 900 

TTTTCTAGAC TTGGGATCTG CAAGAAGGCC AATTGCCTAA AATTTCTGAG AACAGTGCAC 960 

AAGATTATTT TATCACTACA AGCTTTTAAC TTTTTAAGTT ATTGTACAAG TATTCTACCT 1020 
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AAATCTTCCA ATTTCCTTTA AATGGTAAGA GTTTCTAAAA CAGACAATAA TTTTUICAAGC 1080 

40 TCAGCTCTGC TTTATCTGAG TTTAGTGGTC CTAATATATA TGTAGAGAAA GATGGTGGGG 1140 

TTGTTCACCT CTGTACAGAC CATCTGTATG TTAGGTGACA TTGATTATGG GTTATAATCA 1200 

GGGAAACTAA TTGTATTTAG TGACAAAAAT AAAAAGTTTT TTTTTTATAA TTCAGTCTGC 1260 

45 

TTTTGGATTT TCATATATTT AACTTTGCAA AAAGATTTAC TTTGTACATG TTACAGGCTT 1320 

GATTGGTGTA AATCTTITTA TAAATACATA AATAAAAGNA AAATATGCAT TTTTCTTTTC 1380 

50 TAAAAAAAAA AAAAAAAAAA CTCGA 1405 



55 (2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

GTCATGCAGT GCGCCGGAGA ACTGTGCTCT TTGAGGCCGA CGCTAGGGGC CCGGAAGGGA 60 

AACTGCGAGG CGAAGGTGAC CGGGGACCGA GCATTTCAGA TCTGCTCGGT AGAOCTGGrTG 120 

CACCACCACC ATGTTGGCTG CAAGGCTGGT GrTGTCTCCGG ACACTACdT CTAGGGTTTT 180 

CCACCCAGCT TTCACCAAGG CCTCCCCTGT TGTGAAGAAT TCCATCACGA AGAATCAATG 240 

GCTGTTAACA CCTAGCAGGG AATATGCCAC CAAAACAAGA ATTGGGATCC GGCGTGGGAG 300 

AACTGGCCAA GAACTCAAAG AGGCAGCATT GGAACCATCG ATGGAAAAAA TATTTAAAAT ' 360 

TGATCAGATG GGAAGATGGT TTGTTCCTGG AGGGGCTGCT GTTGGTCTTG GAGCATTGTG 420 

CTACTATGGC TTGGGACTGT CTAATGAGAT TGGAGCTATT GAAAAGGCTG TAATTTGGCC 480 

TCAGTATGTC AAGGATAGAA TTCATTCCAC CTATATGTAC TTAGCAGGGA GTATTGGTTT 540 

AACAGCTTTG TCTGCCATAG CAATCAGCAG AACGCCTGTT CTCATGAACT TCATGATGAG 600 

AGGCTCTTGG GTGACAATTG GTGTGACCTT TGCAGCCATG GTTGGAGCTG GAATGCTGGT 660 

ACGATCAATA CCATATGACC AGAGCCCAGG CCCAAAGCAT CTTGCTTGGT TGCTACATTC 720 

TGGTGTGATG GGTGCAGTGG TGGCTCCTCT GACAATATTA GGGQGTCCTC TTCTCATCAG 780 

AGCTGCATGG TACACAGCTG GCAITGTGGG AGGCCTCTCC ACTGTOGCCA TGTGTGCGCC 840 

CAGTGAAAAG TTTCTGAACA TQGGTGCACC CCTGGGAGTG GGCCTQGGTC TCGTCTTTGT 900 

GTCCTCATTG GGATCTATGT TTCTTCCACC TACCACCGTG GCTGGTGCCA CTCTTTACTC 960 

- AGTGGCAAIG-TACGGTQGA'P^ -TASTTCTTTT- GASGATCCTG-CTTGTGTATG^ATAeeeAGAA -102 0 - 

AGTAATCAAG CGTGCAGAAG TATCACCAAT GTATGGAGTT CAAAAATATG ATCCCATTAA 1080 

CTCGATGCTG AGTATCTACA TGGATACATT AAATATATTT ATGCGAGTTG CAACTATGCT 1140 

GGCAACTC3GA GGCAACAGAA AGAAATGAAG OXSACTCAGCT TCTGGCTTCT CTGCTACATC 1200 

AAATATCTTG TTTAATQGGG CAGATATGCA TTAAATAGTT TGTACAAGCA GCTTTCGTTG 1260 

AAGTTTAGAA GATAAGAAAC ATGTCATCAT ATTTAAATGT TCCGGTAATG TGATGCCTCA 1320 

GGTCTGCCTT TTTTTCTGGA GAATAAATGC AGTAATCCTC TCCCAAATAA GCACACACAT 1380 

TTTCAATTCT CATGTTTGAG TGATTTTAAA ATGTTTTGGT GAATGTGAAA ACTAAAGrTTT 1440 

GTGTCATGAG AATGTAAGTC TTTTTTCTAC TTTAAAATTT AGTAGGTTCA CTGAGTAACT 1500 

AAAATTTAGC AAACCTGTGT TTGCATATTT TTTKGGAGTG CAGMMTAWTG TAATTARAGC 1560 

ATTCCAGTAA NAGTGTNTTT AAAGTTGNTC TATATN 1596 



60 



SCXXaO: <WO_983944aA2_l_> 



wo 98/39448 



414 



PCT/US98/04493 



(2) INFORMATION FOR SEQ ID NO: 185: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»^ : SEQ ID NO: 185: 



GCGCAGAGCC CGYACGAGCA GGACGACGAC GACAAGGGCG ACTCCAAGGA AACGCGGCTG 60 

ACCCTGATGG AGGAAGTGCT CCTGCTGGGC CTCAAGGACC GCGARGGTTA CACATCATTT 120 

TGGAATGACT GTATATCATC TGGATTACGT GGCTC3TATGT TAATTGAATT AGCATTGAGA 180 

GGAAGGTTAC AACTAGAGGC TTGTGGAATG AGACGTAAAA GTCTATTAAC AAGAAAGGTA 240 

ATCTOTAAGT CAGATGCTCC AACAGGGGAT GTTCTTCTTG ATGAAGCTCT GAAGCATGTT 300 

AAGGAAACTC AGCCTCCAGA AACGGTCCAG AACTGGATTG AATTACTTAG TGGTGAGACA 360 

TGGAATCCAT TAAAATTGCA TTATCAGTTA AGAAATGTAC GGGAACGATT AGCTAAAAAC 420 

CTGGTGGAAA AGGGTGTATT GACAACAGAG AAACAGAACT TCCTACTTTT TGACATGACA 480 

ACACATCCCC TCACCAATAA CAACATTAAG CAGCGCCTCA TCAAGAAAGT ACAGGAAGCC 540 

GTTCTTGACA AATGGGTGAA TGACCCTCAC CGCATQGACA GGCGCTTGCT GGCCCTCATT 600 

TACCTGGCTC ATCXXTTCGGA CGTCCTGGAG AATGCTTTTG CTCCTCTTCT GGACGAGCAG 660 

TATGATTTGG CTACCAAGAG AGTGCGGCAG CTTCTCGACT TAGACCCTGA AGTGGAATGT 720 

C TGAAG G CCA AC ACCAATGA GGTTCTGTGG GCGGTGGTQG CGGCGTTCAC CAAGTAACTC 780 

TGCTCGGGGT GAACCATTCT CCTTTCTCTC AAGTAAACCA GTAGnTTTTC TTCTGTTGAC 840 

TTCTGGTTTT CTGTAATTTG TACTTTCCCA CACTATAATT GGCTTCTGTT TTACAAAATG 900 

GTGGGTGGCT TTTTCTTTTT TGTACGTGTA CAGGATTCTG CTGOTACGAG AGGCCTTCCT 960 

CTTTCTGTTT TTAAATJ^AAG TTTTACTGCC ATATTGGCAT TCCATTCCCT GTTGCCATCC 1020 

TCACTGTTAC CTGTTTTGGG TTTCTGGTCT ACTTTGACTT TCAAAGTACC TCCAGCCTCC 1080 

TCATACGCAC AGCTTTTGGA TGACCTCAGC TTGAGTTTCT CCATATGTGC ATGTACATCT 1140 

AGCATTCTGC CTACAGTTCA GACAGAAGTC ACAAAAAGGC CTTCAACTCA CCAAAGGTAA 1200 

ATATCTGTAT CTATTAGGAC ATTTTTTACA TAGACTTCAG TTGAGATGTA TACTTAGCAA 1260 

AATTATTTTT AAATTGAAAC AGCACAGTAA ATACTTAATA TAAAATGTCC CTTGGATTTT 1320 

GCTTCCCATG TAAATCTATT GTATTATTAC ACTTGTTATA ATTTTAACTA TAAAGGTCCA 1380 

ATTGTTTCAC AGAGCCAGTT TGGGATGGGC TGCATTCCAT TTATGCTGTA TATAGTTTGA 1440 

ATTATATATA AATTACCCCT TCTTCTQGCC ACCCCTGCTC OCATCTTAGT ATTTTGCAAG 1500 
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ATCTAATCAG TTCTACACCT GGTGCCCCTC GCTTGCTTCA ATCATGGTTA TTTGATOGCA 1560 

^ AAATCGACCT CTTGTCGCTG AAGGAGAGAG AAAAGATCTG TCTCTGATTG GTCCTGQGAT 1620 

TTTTTGAGCT GTCCCATTTA TQGTACTCTT TGCCTATGCA TCCCCTTTTT AGATTTTTTT 1680 

TAAATTTTAT CTTACTGTTT TTATAATTTC TATTGGGAAG AGGCTTGrTGA CCAGTACCAA 1740 

10 TCTTGAGTTT CTTrrTCTGT CCACAAGTAA ATTAATATCT GCTCTGAAAT GTCATTTATC 1800 

TACTCACACA TTCTTGGGGA AAAAAATCAA ATGTCAGTCC TAGCAGATGT TCCATCTAAA 1860 

TTGGTAGCAA GTAATGATTA CAACXTCAGAG GATTAAGAAT TTTGTAACAG AAAGCTCTAT 1920 

GTTTTAATTT TTTATATACA ATTAGGATAA TTAGCATTGT CAGACTATAA ACCTTTOCTT 1980 

TTTAAAGTTT ATTITTACTA TTTCTTTATC ACTTTATTGT ATCATCACCA TrGGTTTCAT 2040 

20 AATGTAAATA CTATATGTTG AACAAATTAA ATGTCAAAAT TTTTTATTAC CATAGTCCAT 2100 

GTTAATAGTG GGGCTTTCAG GTGTTTAGAG ATTTTTTTTG TTGTTGTTAA CATTCATTGC 2160 

AAAAGTACTA GATGGTGTAT AACTCTAGAG TTGAATTTTA AGGGATTCCC TAATATGTAT 2220 

25 

ACTATCTTTT TATCTGAAGT AATAAATAAA CAATGATCTT GAAAGTGCCY RAAAMAAAAA 2280 

AAAAAAAAAA AAA 2293 

30 

<2) INFORMATION FOR SEQ ID NO: 186: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 base pairs 
<B) TYPE: rmr^l^-ir* \ar-if^ 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

GGCACGAGGC GAGCCGGCGC ACCGTACGCT GGGACGTX3TG GTTTCAGCTC GTGCGCCTCC 60 

45 CCGTGOGTTT GCGACGTTTA GCGACTATTG CGCCTGCGCC ACGCCGGCTG CGAGACTGGG 120 

GCCGTGGCTG CTGGTCCCGG GTGATGCTAG GCGGCTCCCT GGGCTCCAGG CTGTTGCGGG 180 

GTGTAGGTGG GAGTCACGGA CGGTTCGGGG CCCGAGGTGT CCGCGAAGGT GGCGCACATG 240 

50 

GGCGGCAGGG GAGAGCATGG CTCAGCGGAT GGTCTGGGTG GACCTGGAGA TGACAGGATT 300 

GGACATTGAG AT^GGACCAGA TTATTGAGAT GGCCTGTCTG ATAACTGACT CTGATCTCAA 360 

55 CATTTTGGCT GAAGGTCCTA ACCTGATTAT AAAACAACCA GATGAGTTGC TGGACAGCAT 420 

GTCAGATTGG TGTAAGGAGC ATCACGGGAA GTCTGGCCTT ACCAAGGCAG TGAAGGAGAG 480 

TACAATTACA TTGCAGCAGG CAGAGTATGA ATTTCTGTCC TTTGTACGAC AGCAGACTCC 540 
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TCCAGGGCTC TGTCCACTTG CAGGAAATTC AGTTCATGAA GATAAGAAGT TTCTTGACAA 600 

ATACATGCXrC CAGTTCATGA AACATCTTCA TTATAGAATA ATTGATXHXSA GCACTGTTAA 660 

AGAACTGTGC AGACXSCTGGT ATCCAGAAGA ATATGAATTT GCACCAAAGA AGGCTGCTTC 720 

TCATAGGGCA CTTGATGACA TTAGTGAAAG CATCAAAGAG CrTCAGTTTT ACCGAAATAA 780 

CATCTTCAAG AAAAAAATAG ATGAAAAGAA GAGGAAAATT ATAGAAAATG GGGAAAATGA 840 

GAAGACCGTG AGTTGATGCC AGTTATCATG CTGCCACTAC ATCGTTATCT QGAGGCAACT 900 

TCTGGTGGTT TTTTTTTCTC ACGCTGATGG CTTGGCAGAG CACCTTCGGT TAACTTGCAT 960 

CTCCAGATTG ATTACTCAAG CAGACAGCAC ACGAAATACT ATTTTTCTCX: TAATATGCTG " 1020 

TTTCCATTAT GACACAGCAG CTCCTTTGTA AGTACCAGGT CATGTCCATC CCTTCGTACA 1080 

TATATGCATT TGCTTTTAAA CCAITTCTTT TGTTTAAATA AATAAATAAG TAAATAAAGC 1140 

TAGTTCTATT .GAAATGCAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1200 

AAAAAAAAAA AN 1212 
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(2) INFORMATION FOR SEQ ID NO: 187: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



GCTTCCGGAA GTTGCTTTTG TCCAAACATC CGGGCTTCTC 
CCCACCTCTC CTCGACCCTG GACGTCTACC TTCCGGAGGC 
GCGCGGGGCT AGCGCGGGTT TCAGCGACGG GAGCCCTCAA 
GCGCCGGCGG GCGGCGCCCG AAATGGAGCT GGCCCGGAAT 
ATCCAGGGCG GAGGCTCAGC TGTGATTGAC ATGGAGAACA 
AGCTTCGAGG ATATGGGTGA GCTGCATCAG CGCCTGCGCG 
GCAGCTGATG CAGCTGCTGC TGAAGAGGAG GATGGAGAGT 
AAGGGACAGC TGAGCCGGCA GGTGGCAGAT CAGATGTGGC 
TCCAGGGCCT TCAGCTTGTA CGCCAACATC GACATCCTCA 
CCTGCTCAGG TGCGAACAGG GCTCCTGGAG TCCATGATCC 
CCCCAGAAAA TTGCAGGTGA ACTCTATGGA CCTCTCATGC 
ATCCTACTCC ATGGGATGAA GACGTCTGAC ACTATTATCC 



CTTTTTGTGT TCCGGCCGAT 
CCACATCTTG CCCACTCCGC 
GGGACATGGC AACTACAGCG 
GGGGAGGGTT CGAAGAAAAC 
TGGATGATAC CTCAGGCTCT 
AGGAAGAAGT AGACGCTGAT 
TCCTGGGCAT GAAGGGCTTT 
AGGCTGQGAA AAGACAAGCC 
GACCCTACTT TGATGTGGAG 
CTATCAAGAT GGTCAACTTC 
TQGTCTTCAC TCTGGTTGCT 
GGGAGGGCAC CCTGATCGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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ACAGCCATTG GCACCTGCTT CGGCTACTGG CIX3GGAGTCT CATO^TTCAT TTACTTX^^ 780 

^ GCCTACCTGT GCAACGCCCA GATCACCATG CTGCAGATGT TGGCACTGCT GGGCTATGGC 840 

CTCrrrGGGC ATTGCATTGr CCTGTTCATC ACCTATAATA TCCACCTCCA CGCCCTCTTC 900 

TACCTCTTCT GGCTGTTGGT GGGTGGACTG TCCACACTGC GCATOGTAGC AGTGTTGGTG 960 

10 TCTCGGACCX; TGGGCCCCAC ACAGCGGCTG CTCCTCTGTG GCACCCTGGC TGCCCTACAC 1020 

ATGCTCTTCC TGCTCTATCT GCATTTTCCC TACCACAAAG TGGTAGAGGG GATCCTOGAC 1080 

ACACTGGAGG GCCCCAACAT CCCGCCCATC CAGAGGGTCC CX::AGAGACAT CCCTCCCATC 1140 

CTCCCTGCTG CTCGGCTTCC CACCACCGTC CTCAACGCCA CAGCCAAAGC TGTTGCGGTG 1200 

ACCCTGCAGT CACACTGACC CXACCTGAAA TTCITGGCCA GTCCTCTTTC CCGCAGCTGC 1260 

20 AGAGAGGAGG AAGACTATTA AAGGACAGTC CTGATGACAT GTTTCGTAGA TGGGGTTTCC 1320 

AGCTGCCACT GAGCTGTAGC TGCGTAAGTA CXnCCTTGAT GCNTGTCGGC ACTTCTGAAA 1380 

GGCACAAGGC CAAGAACTCC TGGCX:AGGAC TGCAAGGCTC TGCAGCCAAT GCAGAAAATC 1440 

25 

GGTCAGCTCC TTTGAGAACC CCTCCCCACC TACCCCTTCC TTCCTCTTTA TCTCTCCCAC 1500 

ATTGTCTTGC TAAATATAGA CTTGGTAATT AAAATGTTGA TTGAAGTCTG GAAAAAAAAA 1560 

30 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAC ICGAG 1605 

35 (2) INFORMATION FOR SEQ ID NO: 188: 

liJ_SEQUS21CELj:mRACTERXSTICS^ 

(A) LENGTH: 1516 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : doxjble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

45 ATTCGGCATG AGGGGGTCAC GTGGTGGCTG GGCCGGGGAA ATGGCGGCTT CAGGAGAGAG 60 

CGGGACTTCA GGCGGCGGAG GCAGCACCGA GGAAGCATTT ATGACCTTCT ACAGTGAGGT 120 

GAAACAAATA GAGAAGAGAG ACTCGGTTCT AACTTCGAAA AATCAGATTG AAAGACTGAC 180 

50 

CCGTCCTGGT TCCTCTTACT TCAATTTGAA CCCATTTGAG GTTCTTCAGA TAGATCCTGA 240 

AGTTACAGAT GAAGAAATAA AAAAGAGGTT TCGGCAGTTA TCCATCTTGG TGCATCCTGA 300 

55 CAAAAATCAA GATGATGCTG ACAGAGCACA AAAGGCTTTT GAAGCTGrrCG ACAAAGCTTA 360 

CAAGTTGCTA CTGGATCAGG AGCAAAAGAA GAGGGCCCTG GATGTAATTC AGGCAGGAAA 420 

AGAATACGTG GAACACACTG TGAAAGAGCG AAAAAAACAA TTAAAGAAGG AAGGAAAACC 480 

60 
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TACAATTCTA GAGGAGGATG ATCCTGAGCT GTTCAAACAA GCTGTATATA AACAGACAAT 540 

GAAACTCTTT GCAGAGCTGG AAATTAAAAG GAAAGAGAGA GAAGCCAAAG AGATGCMGA 600 

AAGGAAACGA CAAAGGGAAG AAGAGATTGA AGCTCAAGAA AAAGCCAAAC GGGAAAGAGA 660 

GTGGCAGAAA AACTTTGAGG AAAGTCGAGA TGGTCGTGTG GACAGCTGGC GAAACTTCCA 720 

AGCCAATACG AAGGGGAAGA AAGAGAAGAA AAATCGGACC TTCXTCAGAC CACCGAAAGT 780 

AAAAATGGAG CAACGTGAGT GACCX5CCCAA GGTCACAGGC ACAGAACCTT TCCXXTTGCTA 840 

TCTCCTTTCC TGCTTCGAAG GACTCATTCT TTCCTCCXAC TTCCACCCCA ACATAGAGTA 900 

15 GTATTTGCnT TTTAGTCCAT TTTGTTTTCA ATACX3ATTTA ATATCGATCA GAGTAATTCT 960 

TTTGTACATT GAAATGAGGG GCTTGGTrTA AAAAAAGACXT TTTCCCTCTC CCTGCCCCTA 1020 

GAACAACCAG TATTAGAAGG TGCCACCATT GGTGCTGCCT TCTCTTCCCA CAGCCTGTAA 1080 

20 

CTCAGTGTTT TGTACTICAC TGAATTGTGA TGGTTAGAAA CTTCGTGGAT AGTTTGTOGA 1140 

AATCATCCAA TTAAACATAC TGCTTAAAAC AGTGTTGCTG TGACTTCAGA GACAAGCCTC 1200 

25 GAAGQGGCAC CTTAGGAAGC CCerTCGCTT GAGTTGCTCG CTTCTGGGOXS TGCTCXXTTTC 1260 

GAAGGCCCAG ATAAGACAGG GAACACTTGT GAGCACACAG AGCAGCATCT GATGCCCTGT 1320 

GGTGTTTGGC ATGTGCCCCC TGTCTACTGA CCAATCAGTG TGGCATGAGG CXXIACGCXAC 1380 

30 

CCAAACCTTT CACTTTCCAA AGAGCTAGCC GTCCTCCACC CAGTACCATG TCCTAGCCTG 1440 

TCTGCATTTG TTAGTGGTAA TATTCTTTAT GTATAATAAA TTTTTATACC CAAAAAAAAA 1500 

35 AAAAAAAAAA ACTCX3A 1516 



40 (2) INFXDRMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEM3TH: 681 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

50 GCTCCCATGT TGCTGGCTGT CCGTACATCA CCCTGTCCCC TGCAGGAGGG GGCTACAGGC 60 

CATCTCCCTC CTGTAGGCCT CTGACTCCCC TCCACTTTTG GGCCCTCAGC TTATCTCGGG 120 

CAGGGGACCA TTGCAGCATC CTCCCCTCCT CNGGACTCAA GGTQCTGAGG TATAAGCCCT 180 

55 

GGGCCCCAGA TCCCTGRTKA CACCTTCCTG GAGAAGACTC TCAAAAGTGA CTGTATATTT 240 

GAGTTCACCA GCAATAACTC CCCACACTCG AAGCAGGTCC AAACCCMAGG ATCCCAGGGT 300 

60 CCTTGGGCTC TGTOGCACTG TCTTCCCAAG ATCCTTCCTG TTGCACAATG GGAAACCTAA 360 
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GAGGAAAAAG ACAGGGGCCT GCTTGCCCAG CCATGCGAGG GATTCCATX3C CCACXTIXXXX: 420 

^ TCTGYCTGCC TCGCTQGAAT GTGGGCCCCT GCTCOXGTC AGGTTGTGCT GTCTCTCACC 480 

TATGrrTACA TCCCCGAGGG GTrTCTGCCT CCTCXXCACC CAGGTCAGGG TGTGGTCCAG 540 

CAGCTTGCTG TGGGGTGCTG ACATGTGTCA CCACTGCXXX: CCTTGCCCCC GGGGGGGTCA 600 

10 TGGTCTCCTC CTGGATGCTG CICCTTGAAT YTTTTTTYTT GAWAAACCYT TTAMAATTAA 660 

AAAAAAAAAA AAAAAACTCG A 681 

15 

(2) INFt)RMATIC»J FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERXSTICS : 
20 (A) IiENGTH: 1014 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

GCCTCAAGCC ACGCATATGA TAATTTTCTG GAACATTCAA ATTCAGTGTT TCTACAGCCA 60 

GTTAGTCTAC AAACCATTGC AGCAGCACCA TCAAACCAGA GTCTGCCACT TTITCTCATC 120 

30 

GCTGGATGAT TGCTGGGCAA AGGTGGCCTT TTAGAGCTCT TAAAAGCCCA CAAAAAGGCT 180 

ATTCGTAGAG CCACAGTCAA CACATTTGGT TATATTGCAA AGGCCATTGG CCTCATGATG 240 

35 TATTGGCTAC ACTTCTGAAC AACCTCAAAG TTCAAGAAAG GCAGAACAGA GTTTGTACCA 300 

CT&TAGCAAT„ AGCTATTGCT-jGCAGAAACAT--^^ -3 6^ 

TGAATGAATA CAGAGTTCCT GAACTGAATG TTCAAAATGG AGTGTTAAAA TCGCTTTCCT 420 

40 

TCTTGTTTGA ATATATTGGT GAAATGGGAA AAGACTACAT TTATGCCGTA ACACCGTTAC 480 

TTGAAGATGC TTTAATGGAT AGAGACCTTG TACACAGACA GACGGCTAGT GCAGTGGTAC 540 

45 AGCACATGTC ACTTGGQGTT TATGGATTTG GTTGTGAAGA TTCGCTGAAT CACTTGTTGA 600 

ACTATGTATG GCCCAATGTR TTTGAGACAT CTCCTCATGT AATTCAGGCA GTTATGGGAG 660 

CCCTAGAGGG CCTGAGAGTT GCTATTGGAC CATGTAGAAT GTTGCAATAT TGTTTACAGG 720 

50 

GTCTGTTTCA CCCAGCCCGG AAAGTCAGAG ATGTATATTG GAAAATTTAC AACTCCATCT 780 

ACATTGGTTC CCAGGACGCT CTCATAGCAC ATTACCCAAG AATCTACCAA CGATGATAAG 840 

55 RACACCTATA TTCGTTATGA ACTTGACTAT ATCTTATAAT TTTATTGTTW ATTTKGTGKT 900 

TAATGCACAS TACTTCACAC CTTAAACTTG CTTTGATTTG GTGATGTAAA CTTTTAAACA 960 

TTGCAGATCA GTGTAGGACT GGTCCATAGG GGAAGAGCTA GGAANTCCAT AGGC 1014 

60 
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(2) INFORMATION FOR SEQ ID NO: 191: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2779 base pairs 

(B) TYPEi nucleic acid 

(C) STRANDEDNESS: double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TCGCAGCAGG GTGTGTCCAG ATGGTCAGTC TCTGC5TGGCT AGCCTGTCCT GACAGGGGAG 60 

AGTTAAGCTC CCGOTCTCCA CCGTCXTCGGC TGGCCAGGTG GGCTC5AGGGT GACCGAGAGA 120 

CCAGAACCTG CTTGCTGGAG CTTAGTGCTC AGAGCTGGGG AGGCiAGGTTC OGCCGCTCCT 180 

20 CTGCTGTCAG CGCCGGCAGC CCCTCCCGGC TTCACTTCCT CCCGCAGCCC CTGCTACTGA 240 

GAAGCTCCGG GATCCCAGCA GCCGCCACGC CCTGGCCTCA GCCTGCGGGG CTCCAGTCAG 300 

GCCAACACCG ACGCGCANTG GGAGGAAGAC AGGACCCTTG ACATCTCCAT CTGCACAGAG 360 

25 

GTCCTGGCTG GACCGAGCAG nCCTCCTCCTC CTAGGATGAC CTCACCCTCC AGCTCTCCAG 420 

TTTTCAGGTT GGAGACATTA GATGGAGGCC AAGAAGATGG CTCTGAGGCG GACAGAGGAA 480 

30 AGCTGGATTT TGGGAGCGGG CTGCCTCCCA TGGAGTCACA GTTCCAGGGC GAGGACCGGA 540 

AATTCGCCCC TCAGATAAGA GTCAACCTCA ACTACCGAAA GGGAACAGGT GCCAGTCAGC 600 

CGGATCCAAA CCGATTTGAC CGAGATCGGC TCTTCAATGC GGTCTCCCGG GGTGTCCCCG 660 

35 

AGG ATCTGGC TGGACTTCCA GAGTACCTGA GCAAGACCAG CAAGTArOTn Anrr:AOT<"r3G 320- 

AATACACAGA GGGCTCCACA GGTAAGACGT GCCTGATGAA GGCTGTGCTG AACCTTAAGG 780 

40 ACGGGGTCAA TGCCTGCATT CTGCCACTGC TGCAGATCGA CCGGGACTCT GGCAATCCTC 840 

AGCCCCTGGT AAATGCCCAG TGCACAGATG ACTATTACCG AGGCCACAGC GCTCTGCACA 900 

TCGCCATTGA GAAGAGGAGW CTGCAGTGTG TGAAGCTCCT GGTGGAGAAT GGGGCCAATG 960 

45 

TGCATGCCCG GGTCTGCGGC GCTTCTTCCA GAAGGGCCAA GGGACTTGCT TTTATTTCGG 1020 

TGAGCTACCC CTCTYTTTGG CCGCTTGCAC CAAGCAGTGG GATGTGGTAA GCTACCTCCT 1080 

50 GGAGAACCCA CACCAGCCCG CCAGCCTGCA GGCACTGACT CCCAGGGCAA CACAGTCCTG 1140 

CATGCCCTAG TGATGATCTC GGACAACTCA GCTGAGAACA TTGCACTGGT GACCAGCATG 1200 

TATGATGGGC TCCTCCAAGC TGGGGCCCGC CTCTGCCCTA CCGTGCAGCT TGAGGACATC 1260 

CGCAACCTGC AGGATCTCAC GCCTCTGAAG CTGGCCGCCA AGGAGGGCAA GATCGAGATT 1320 

TTCAGGCACA TCCTGCAGCG GGAGTTTTCA GGACTGAGCC ACCTTTCCCG AAAGTTCACC 1380 

60 GAGTGGTGCT ATGGGCCTGT CCGGGTGTCG CTGTATGACC TGGCTTCTGT GGACAGCTGT 1440 
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GAGGAGAACT CAGTGCTGGA GATCATTGCC TTTCATTGCA AGAGCCCGCA CCGACACCGA 1500 

ATGGTCGTTT TGGAGCCCCT GAACAAACTG CTGCAGGCGA AATGGGATCT GCTCATCCXC 1560 

AAGTTCTTCT TAAACTTCCT GTOTAATCTG ATCTACATGT TCATCTTCAC CXSCTGTTGCX: 1620 

TACCATCAGC CTACCCTGAA GAAGCAGGCC GCXXXTTCACC TGAAAGCGGA GGTTGGAAAC 1680 

TCCATGCTGC TGACGGGCCA CATCCTTATC CTGCTAGGGG GGATCTACCT CCTCGTOGGC 1740 

CAGCTGTGGT ACTTCTGGCG GCGCCACGTG TTCATCTGGA TCTCGTTCAT AGACAGCTAC 1800 

TTTGAAATCC TCTTCCTGTT CCARGCCCTG CTCACAGTGG TGTCCCARGT GCTGTCTTTC 1860 

CTGGSCATCG AGTGC5TACCT GCCCCTGCTT GTGTCTGCGC TGGTGCTGGG CTGGCTGAAC 1920 

CTGCTTTACT ATACACGTGG CTTCCAGCAC ACAGGCATCT ACAGTGTCAT GATCCAGAAG 1980 

CCCTGGTGAG CXTGAGCCAG GANhrrTGGCG CCCCGAAGCT CCTACAGGCC CCAATGCCAC 2040 

AGAGTCAGTG CAGCCCATGG AGGGACAGGA KGACGAKGGC AACGGGGCCX: AGTACAQGGG 2100 

TATCCTGGAA GCCTCCTTGG AGCTCTTCAA ATTCACCATC GGCATGfGGCG AGCTCGOCTT 2160 

CCAGGARCAG CTGCACTTCC GCGGCATGGT GCTGCTGCTG CTGCTGGSCT ACGTGCTGCT 2220 

CACCTACATC CTGCTGCTCA ACATGCTCAT CX3CCCTCATG AGCGAGACCG TCAACAGTGT 2280 

CGCCACTGAC AGCTGGAGCA TCTGGAAGCT GCAGAAAGCC ATCTCTGTCC TGGAGATGGA 2340 

GAATGGCTAT TGGTGGTGCA GGAAGAAGCA GCXX3GCAGGT GTGATGCTGA CCGTTGGCAC 2400 

TAAGCCAGAT GGCAGCCCSG ATGAGCGCTG GTGCTTCAGG GTGGAGGAGG TX3AACTGGGC 2460 

TTCATGGGAG CAGACX3CTGC CTACGCTGTG TGAGGACCCG TCAGGGGCAG GTCTCCCTCG 2520 



AACTCTCGAG AACCCTGTCC TGGCTTCCCC TCCCAAGGAG GATGAGGATO GTGCCTCTGA 2580 

GGAAAACTAT GTGCCCGTCC AGCTCCTCCA GTCCAACTGA TGGCCCAGAT GCAGCAGGAG 2640 

GCCAGAGGAC AGAGCAGAGG ATCTTTCCAA CX:ACATCTGC TGGCTCTGGG GTCCCAGTGA 2700 

ATTCTGGTGG CAAATATATA TTTTCACTAA CTCAAAAAAA AAAAAAAAAA AAAAAAAAAA 2760 

AAAAAAAAAA AAAAAAGGC 2779 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1923 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESS : dotible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 
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ACXrCGCTCCG CTCCGCTCCG CTCGGCCCCG CGCCGCCCGT CAACATGATC CGCTGCGGCC 
TGGCXTTGCGA GCGCTGCCGC TQGATCCTGC CCXTICCTCCT ACTCAGCGCX: ATCCXXTTCG 
ACATCATCGC GCTGGCCGGC CGCGGCTGGT TGCAGTCTAG CGACCACGGC CAGACGTCCT 
CGCTGTGGTG GAAATGCTCC CAAGAGGGCG GCXSGCAGCGG GTCCTACGAG GAGGGCTGTC 
AGAGCCTCAT GGAGTACX3CG TGGGGTAGAG CAGCGGCTGC CATGCTCTTC TGTGC3CTTCA 
TCATCCTGGT GATCTGTTTC ATCCTCTCCT TCTTCGCCCT CTGTGGACCC CAGATGCTTG 
TCTTCCTGAG AGTGATTGGA GGTCTCCTTG CCTTGGCTGC TGTGTTCCAG ATCATCTCCC 
OXSGTAATTTA CCCCGTGAAG TACACCCAGA CXTTTCACCCT TCATGCCAAC CSTGCTGTCA 
CTTACATCTA TAACTGGGCC TACGGCTTTG GGTGGGCAGC CACGATTATC CTGATYGGCT 
GTGCCTTCTT CTTCTGCTGC CTCCCCAACT ACGAAGATGA CCTTCTQGGC AATGCCAAC3C 
CCAGOTACTT CTACACATCT GCCTAACTTG GGAATGAATG TGGGAGAAAA TCGCTGCTGC 
TGAGATGGAC TCCAGAAGAA GAAACTGTTT CTCCAGGCGA CTTTGAACCC ATTTTTrGGC 
AGTCTTCATA TTATTAAACT AGTCAAAAAT GCTAAAATAA TTTGGGAGAA AATATTTTTT 
AAGTAGTGTT ATAGTTTCAT GTTTATCTTT TATTATGTTT TGTGAAGTTG TGTCTTTTCA 
CTAATTACCT ATACTATGCC AATATTTCCT TATATCTATC CATAACATTT ATACTACATT 
TGTAAGAGAA TATGCACGTG AAACTTAACA CTTTATAAGG TAAAAATGAG GTTTCCAAGA 
TTTAATAATC TGATCAAGTT CTTGTTATTT CCAAATAGAA TGGACTCGGT CTGTTAAGGG 
CTAAGGAGAA GAGGAAGATA AGGTTAAAAG TTGTTAATGA CCAAACATTC TAAAAGAAAT 



GCAAAAAAAA AGTTTATTTT CAAGCCTTCG AACTATTTAA GGAAAGCAAA ATCATTTCCT 
AAATGCATAT CATTTGTCAG AATTTCTCAT TAATATCCTG AATCATTCAT TTCAGCTAAG 
GCTTCATGTT GACTCGATAT GTCATCTAGG AAAGTACTAT TTCATQGTCC AAACXTTCTTG 
CCATAGTTGG TAAGGCTTTC CTTTAAGTGT GAAATATTTA GATGAAATTT TCTCTTTTAA 
AGTTCTTTAT AGGGTTAGGG TGTGGGAAAA TGCTATATTA ATAAATCTGT AGTGTTTTGT 
GrrrATATGT TCAGAACCAG AGTAGACTGG ATTGAAAGAT GGACTGGGTC TAATTTATCA 
TGACTGATAG ATCTGGTTAA GTTGTGTAGT AAAGCATTAG GAGQGTCATT CTTGTCACAA 
AAGTGCCACT AAAACAGCCT CAGGAGAATA AATGACTTGC TTTTCTAAAT CTCAGGTTTA 
TCTGGGCTCT ATCATATAGA CAGGCTTCTG ATAGTTTGCA ACTGTAAGCA GAAACCTACA 
TATAGTTAAA ATCCTGGTCT TTCTTGGTAA ACAGATTTTA AATGTCTGAT ATAAAACATG 
CCACAGGAGA ATTCGGGGAT TTGAGTTTCT CTGAATAGCA TATATATGAT GCATCGGATA 
GGTCATTATG ATTTTTTACC ATTTCGACTT ACATAATGAA AACCAATTCA mTAAATAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
~li4b~ 
. 1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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CAGATTATTA TTTTGTAAGT TGTGGAAAAA GCTAATTGTA GTTTTCATTA TGAAGTTTTC 1860 
CXIAATAAACC AOGTATTCTA AAAAAAAAAA AAAAAAACTN GAGGGGGGGC CCGOTACXXA 1920 
ATT 1923 



10 (2) INFORMATION FOR SEQ ID NO: 193:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2346 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) TOPOIOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

20 AGGCTCAGGG GGACACTCTC AAAATTACAC AGCTTTTAAC AGGTGGCAGA ATTGGGGTTC 60 

AGACCCAGAT CTGGGTTCAA GTCACTCATG GTGTGATTGC GGCATTCCTT CCCGCATCTC 120 

GGCCTTGCCA TCTCTCTCTC CGAGTGGACA TGGAGAGGAC GGGGGCCCAG CAGCTGGATC 180 

25 

GCTGCAGGGG ATCAAGTCTT CTCTGGGGCT GGGCACGTAN AAGAGCATGT GGCTOGTCGA 240 

CGGCATGCCT GGCTCCTCAC CTGGCAGTCT GCCTGCCCTG CTAACCGGCT GTCTCTTCTT 300 

30 CCCCTAGTGC CCTCGGCTAG CATGACCCGC CTGATGCGWT SCCGCACAGC CTCTGOnCC 360 

AGCGTCATTC TCTGGATGGC ACCCGCAGCC GCTCCCACAC CAGCGAGGGC ACCCGAAGCC 420 

GCTCCCACAC CAGCGAGGGC ACCCGCAGCC GCTCGCACAC CAGCGAGGQG GCCCACCTGG 480 

ACATCACCCC CAACTCGGGT GCTGCTGGGA ACAGNGCCGG GCCCAAGTCC ATGQAGGTCT 540 



35 



CCTGCTAGGC GGCCTGCCCA GCTGCCGCCC CCGGACTCTG ATCTCTGTAG TGGCCCCCTC 600 



660 



40 CTCCCCGGCC CCTTTTCGCC CCCTGCCTGC CATACTGCGC CTAACTCGGT ATTAATCCAA 

AGCTTATTTT GTAAGAGTGA GCTCTGGTGG AGACAAATGA GGTCTATTAC GTGGGTGCCC 720 

TCTCCAAAGG CGGGGTGGCG GTGGACCAAA GGAAGGAAGC AAGCATCTCC GCATCGCATC 780 

45 

CTCTTCCATT AACCAGTGGC CGGTTGCCAC TCTCCTCCCC TCCCTCAGAG ACACCAAACT 840 

GCCAAAAACA AGACGCGTAC AGCACACACT TCACAAAGCC AAGCCTAGGC CGCCCTGAGC 900 

50 ATCCTGGTTC AAACGGGTGC CTGGTCAGAA GGCCAGCCGC CCACTTCCCG TTTCCTCTTT 960 

AACTGAGGAG AAGCTGATCC AGTTTCCGGA AACAAAATCC TTTTCTCATT TGGGGAGGGG 1020 

GGTAATAGTG ACATGCAGGC ACCTCTTTTA AACAGGCAAA ACAGGAAGGG GGAAAAGGTG 1080 

GGATTCATGT CGAGGCTAGA GGCATTTGGA ACAACAAATC TACGTAGTTA ACTTGAAGAA 1140 

ACCGATTTTT AAAGTTGGTG CATCTAGAAA GCTTTGAATG CAGAAGCAAA CAAGCTTGAT 1200 

60 TTTTCTAGCA TCCTCTTAAT GTGCAGCAAA AGCAGGCRAC AAAATCTCCT GGCTTTACAG 1260 
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ACAAAAATAT TTCAGCAAAC GTTGGGCATC ATGCTTTTTG AAGGCTTTAG 
TGCCTCTCCT CCACAGCCCC AACCTCXCAC CCCTGATACA TGAGCCAGTG 
TTCAGGGAGA AGATCATTTA GATTrGTTTT GCATTCCTTA GAATGGAGGG 
CAGCTGCCCT GGCTGTGATG AGTGTCCTTG CAGGGGCCGG AGTAGGAGCA 
GGCGGAATTG GGGrTACTCG ATGTAAGGGA TTCCTTGTTG TTCTGTTGAG 
GTTGTGATTT CTGTGGATCC CAGCTTGGTT CCAGGAATTT TGTGTGATTG 
AGTTTTCAAT CTTCGACAGC TGGGCTGGAA CGTGAACTCA GTAGCTGAAC 
CGGTCACGTT CTTGGATCCT CAGAACTCTT TGCTCTTGTC GGGGTGGGGG 
CGTGGGGAGC GGTGGCTGAG AAAATGTAAG GATTCTGGAA TACATATTCC 
COTTCCCTCT CXTTGCTTCCT CTTTTCCTGC TCCCTAACCT TTCGCCGAAT 
CACTGAOGTT TCTGGGCGGC CAGTGCGGCT GCCAGGTTCC TGTACTACTG 
TTCATTTTGG CTCACCX3TGG ATTTTCTCAT AGGAAGTTTG GTCAGAGTGA 
GTAAGTCAGC CACTGGGACC CGAGGATTTC TGGGACCCCG CAGTTGGGAG 
TCCAGCCTTC CAGGTGGCGT GAGAGGCAAT GACTCGTTAC CTGCCGCCCA 
GGCCTTCCCT GGCCTTGAGT AGAAAAGTCG GGGATCGGGG CAAGAGAGGC 
TGGGAAACTA TTGTGCACAA GTOrTTCCAG AGGAGTTTCT TAATGAGATA 
TTTCCAGACC AATAAATTTG TAACTTTGCA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAACT CGAGGGQGGC CCOTACCCPA TTCGCCX3TAT 
ACAATC 



TTCTGCTTTC 
ATTATTCTTG 
CAACATTCCA 
CTGGGGTGGG 
ATCCAGTCCA 
GCTTAAATCC 
CTGTCTGACC 
TGGGAACTCA 
ATGGGACTTT 
GGGGCAGCAC 
CXriTCTACTT 
ATTGAATATT 
GAGGAAGTAG 
TCACCTTGGA 
TGAGTACGGA 
TTTGTATTTA 
AAAAAAAAAA 
ATGATCX3TAA 



1320 
1380 
1440 
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2040 
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2160 
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2280 
2340 
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(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3054 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
TATCTGAACC ACCCTTTATT CTACATATGA TAGGCAGCAC TGAAATATCC TAACCCCCTA 
AGCTCMAGGT GCCCTGTGGN ACGAGCAACT GGACTATAGC AGGGCTGGGC TCTGTCTTCC 
TGGTCATAGG CTCACTCTTT CCCCCAAATC TTCCTCTGGA GCTTTGCAGC CAAGGTGCTA 
AAAGGAATAG GTAGGAGACC TCTTCTATCT AATCCTTAAA AGCATAATGT TGAACATTCA 



60 
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180 
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TTCAACAGCT GATGCCCTAT AACCCCTGCC TOGATOTCTT CXTTATTAGGC TATAAGAAGT 300 

AGCAAGATCT TTACATAATT CAGAGTGGTT TCATTGOCTT CXTTACCCTCT CTAATCGCCC 360 

CTCCATTTAT TTGACTAAAG CATCACACAG TGGCACTAGC ATTATACCAA GAGTATCAGA 420 

AATACAGTGC TTTATGGCTC TAACATTACT GCCTTCAGTA TCAAGGCTGC CTGGAGAAAG 480 

GATGGCAGCC TCAGGGCTTC CTTATGTCCT CXACCACAAG AGCTCCTTGA TCAAGGTCAT 540 

CTTTTTCCCC TATCCTGTTC TTCXXTCTCCC CXSCTCCTAAT GGTACGTGGG TAOXAGGCT 600 

GGTTCTTGGG CTAGGTAGTG GQGACCAAGT TCATTACCTC CCTATCAGOT OTAGCATAGT 660 

AAACTACX3GT ACCAGTGTTA GTGGGAAGAG CTGGGTTTTC CTAGTATACC CACTGCATCC' 720 

TACTCCTACC TGGTCAACCC GCTGCTTCCA GGTATGGGAC CTGCTAAGTG TOGAATTACC 780 

TGATAAGGGA GAGGGAAATA CAAGGAGGGC CTCTGGTOTT CXrTGGCCTCA GCCAGCTCCC 840 

CACAAGCCAT AAACXZAATAA AACAAGAATA CTGAGTCAGT TTTTTATCTG GGTTCTCTTC 900 

ATTCCCACTG CACTTGGTGC TGCTTTGGCT GACTGGGAAC ACCCCATAAC TACAGAGTCT 960 

GACAGGAAGA CTCGAGACTG *TCCACTTCTA GCTCGGAACT TACTGTGTAA ATAAACTTTC 1020 

AGAACTGCTA CCATGAAGTG AAAATGCCAC ATTTTGCTTT ATAATTTCTA CXTCATGTTGG 1080 

GAAAAACTGG CTTTTTCCCA GCCCTTTCCA GGGCATAAAA CTCAACCXXTT TCGATAGCAA 1140 

GTCCCATCAG CCTATTATTT TTTTAAAGAA AACTTGCACT TGTTTTTCTT TTTACAGTrA 1200 

CTTCCTTCCT GCCCCAAAAT TATAAACTCT AAGTGTAAAA AAAAGTCTTA ACAACAGCTT 1260 

CTTGCTTGTA AAAATATGTA TTATACATCT GTATTTTTAA ATTCTGCTCC TGAAAAATGA 1320 

_ j::TGTCCCaTX-CKX:aCTCAC--^^ 4:38^- - 

CATTGCAGGC CAGTGGACAG AGGGAGAAGG GAGAACAGGG GTCGCCAACA CTTCTGTTGC 1440 

TTTCTGACTG ATCCTGAACA AGAAAGAGTA ACACTGAGGC GCTCGCTCCC ATGCACAACT 1500 

CTCCAAAACA CTTATCCTCC TGCAAGAGTG GGCTTTCCAG GGTCTTTACT GGGAAGCAGT 1560 

TAAGCCCCXrr CCTCACCXrCT TCCTTTTTTC TTTCTTTACT CCTTTGGCTT CAAAGGATTT 1620 

TGGAAAAGAA ACAATATGCT TTACACTCAT TTTCAATTTC TAAATTTGCA GGGGATACTG 1680 

AAAAATACGG CAGGTGGCCT AAGGCTGCTG TAAAGTTGAG GGGAGAGGAA ATCTTAAGAT 1740 

TACAAGATAA AAAACGAATC CCCTAAACAA AAAGAACAAT AGAACTGGTC TTCGATTTTG 1800 

CX:ACCTTTCC TGTTCATGAC AGCTACTAAC CTGGAGACAG TAACATTTCA TTAACCAAAG 1860 

AAAGTGGGTC ACCTGACCTC TGAAGAGCTG AGTACTCAGG CCACTCCAAT CACCCTACAA 1920 

GATGCTAAGG AGGTCCCAGG AAGTCCAGCT CCTTAAACTG ACGCTAGNCA ATAAACCTGG 1980 

GCAAGTGAGG CAAGAGAAAT GAGGAAGAAT CCATCTGTGA GGTGACAGGC AAGGATGAAA 2040 
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GACAAAGAAG GAAAAGAGTA TCAAAGGCAG AAAGGAGATC ATTTAGTTGG GTCTGAAAGG 2100 

AAAAGrrCTTT GCTATCCGAC ATCTACTGCT ACTACCTGTA AGCATTTTAG GTCCCAGAAT 2160 

GGAAAAAAAA ATCAGCTATT GGTAATATAA TAATGTCCTT TCCCTGGAGT CAGTTrTTTT 2220 

AAAAACTTTAA CTCTTAGTTT TTACTTCTTT AATTCTAAAA GAGAAGGGAG CTGAGGCCAT 2280 

TCCCTGTAGG AGTAAAGATA AAAGGATAGG AAAAGATTCA AAGCTCTAAT AGAGTCACAG 2340 

CTTTCCXMG TATAAAACCT AAAATTAAGA AGTACAATAA GCAGAGGTGG AAAATGATCT 2400 

AGTTCCTGAT AGCTACCCAC AGAGCAAGTG ATTTATAAAT TTGAAATCCA AACTACTTTC 2460 

TTAATATCAC TTTGGTCTCC ATTTTTCCCA GGACAGGAAA TATGTCCCCX: CCTAACTTTC 2520 

TTGCTTCAAA AATTAAAATC CAGCATCCCA AGATCATTCT ACAAGTAATT TTGCACAGAC 2580 

ATCTCCTCAC CCCAGTGCCT GTCTGGAGCT CACCX:AAGGT CACCAAACAA CTTGGTTGTG 2640 

AACCNAACTG CXriTAACCTT CTGQGGGAGG GGGATTAGCT AGACTAGGAG ACCAGAAGTG 2700 

AATGGGAAAG GGTGAGGACT TCACAATGTT GGCCTGTCAG AGCTTGATTA GAAGCCAAGA 2760 

CAGTGGCAGC AAAGGAAGAC TTGGCCCAGG AAAAACCTGT GGGTTGTGCT AATTTCTGTC 2820 

CAGAAAATAG GGTOGACAGA AGCTTGTGGG GTGCATGGAG GAATTGGGAC CTGGTTATGT 2880 

TGTTATTCTC GGACTGTGAA TTTrGGTGAT GTAAAACAGA ATATTCTGTA AACCTAATGT 2940 

CTGTATAAAT AATGAGCGTT AACACAGTAA AATATTCAAT AAGAAGTCAA AAAAAAAAAA 3000 

AAAAAACTCG AGGGGGGGCC CGGTACCCAA TTTNCCAAAT AGAGATNGTA TTAC 3054 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

GGCAGAGCTC GTGGCCGNAA CTTTTTCTGC TCCTGGCTGC CACCTACTGG CTGGCCGCGG 60 

CCCTGGCCTG GGCCTGCACC AGCCTGCGNG CGGGCTCCCA CAGCAGCCCC CTTCCAAGCA 120 

GCGTCCCCAC ACCGCGCACC TTCTGCGGGA ACGTGCTCGC CGTGCCGQGG AOCATATGGA 180 

CGGAAGGCTT TGTGCTCACC TACAAGCTGG GTGAGCAGGG TGCCAGCAGC CTGTTGATCC 240 

TCTTQGCTCC TGCTGGAGCA CGAGCGGCGT TTCTGCTCCC GAGTTGGGAC TGTGGAATGG 300 

TGTGGGTGCT GTGGTCTGCT CCATCGCTGG CTCCTCCCTG GGTGGGACCT TGCTGGCCAA 360 

GCACTGGAAA CTGCTGCCTC TGTGAGGTCG GTGCTGCGCT TCCGCCTCGG GGGCCTAQCC 420 
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TCTCAGACTG CCTTX3GTCTT CCACCTTGGA CACCCTQGGG GCCAGCATGG ACGCTGGCAC 480 

AATCTTGAGA GGGTCAGCCT TGCTGAGCCT ATGTCTGCAG CACTTCTTGG GAPGCCTGGT 540 

5 

CACCACAGTC ACCTTCACTG GGAATGATGC GCTGCAGCCA GCTGGCXXXX: AGGGCXTTIGC 600 

AGGCCACACA CTACAGCCTT CTGGCCACGC TGGAGCTGCT GGGGAAGCTG CTGCTGGGCA 660 

10 CTYTGGSCGG AGGGCCTGGC TGATGGGTTG GGGCXACATC CCTGCTTCTT GCTCCTGCTC 720 

ATCCTCTCTG CCTTTCCCGT TCTGTACXrTG GACCTAGCAC CCAGCACCTT TCTCTGAGCT 780 

GAC5TQGCTGG AGTGGTCAAT AAAGCCACAT GTGCCTGTGG CCCAAAAAAA AAAAAAAAAA 840 

AAAAAAAAAA AAAAAAACTG GAGGGGGGGC CCGGTACCCA AATCGCCGGA TATCATCGTA 900 

AACAATC 907 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 196: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEnNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 
GGCACGAGGA GGGACAGGGA GTGGGCAAGG GGAAGAAGCA GCTTATTTGA CTAACCAGCC 60 
35 CCTCTGTGGT CCACCAGCGT CTTGGCITGG TGGGAGGGCT CTCAATCAGC AGGGCCCCAG 120 

KAGGGCAAGA-AGAAGTGGGG-CAAAGCCTGG-CGCTCGGCOT 180-- 

TCTGGAGCCA CGCCTCCTCC AGGCCATGCT CCTTGAACTT GGAAATGTCA ACCGGAGCCC 240 

40 

TTAACACCAG CCCTCCAGCA TCTAATAGAC TTGAATCTAC TCTAAACGAA TATTTAATCC 300 
AACCTCAACT ACATTGTAGC TCAGTCCAAC GACTAACCCT GAAATGGGGG TGTTCCAGCC 360 
45 TTCAGCGAGA TGGCCAAGCG GTCCCCTGGG GGCTGTGGCA GCGGGCTTAT CCTTCTCTGT 420 
TGCCAACCTT GCCGTCCGAC CTCCTCCGCC CCCATGCGGT GACCCCGTCC GTGTCTGTGT 480 
CTGTCCATAC GTGTGAGTCC AGCTAAAAAG ACAAAACAGA ACCCGTGGGC CCAGCTCGGA 540 

50 

AGGTGCGTGG AGAAGGCTCC GACGTCTCCG AAGTGCAGCC CTTGGGATGG CATTCCGTTG 600 
TGTGCCTTAT TCCTGGAGAA TCTGTATACG GCTCGCCTAT AAGAAATATA GCCTCTTCAT 660 
55 GCTGTATTAA AAGGACTTTT AAAAGCAAAA AAAAAAAAAA AAAAACTCGA GGGGGGGCCC 720 
GGTACCCAAT TCGCCCAATA GTGAGTCGTA TTACAATTCA CTGGGCCGTC STTTTAACAA 780 
CGTCGTGAAC TGGGAAAACC CTGGCGTTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC 840 



60 



ISOOCID: <WO_9839448A2_l_> 
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CTTTCGCCAG CTGGCGTTAA TAGCGAAAAA NGCCCGCACC CGAATCGCCX: TTCCCAACAG 



900 



TTTGCGCAGC CCTGAATCGC GAAATGGCAA ATTGTAAGCG TTTAATATTT TKKnTAAAAT 



960 



TCCNCGTTWA AWTTTTTGTT TAAATCARCT CAATTTTTTT AACCCAATAA GSCCGAAATC 



1020 



CXSGCAAATCC CCYTTATTAA TTCCAAAAAA ATAAACCSAA AAWQGGTTTG AATTTTTTKT 



1080 



TTCCCCAYTT TTGGAAACAA AWrYCCCCCT TTTTAAAAAA GTTGGAACCX: CCA^ECCYTCC 



1140 



AAAGGGGAAA AAACSYTTTT YTGGGGGGNA ANGGGGCCCC COTACTTTNA ACAYCCCCCC 



1200 



CXZAAWCAATT TTTTTGQGGG GTCCCNAAAG GTCCCCCTAA AANCTTTTTT CGGAACCCNA 



1260 



AGGGGANCXX: CCCATTTAAA ATTTTNGGTN 



1290 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : douOble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

GGTGTGCCTG GATGGTCGTG TAGGTGAGTT TTACCAAGGA TTATGGTAAC AAATGAGTGA 60 

GACCTCTATG GAGAAAATAT TGAAGNNCAT TAAAGAAGAC CTCATANTAG GAGAGAATGT 120 

SCTTTGGAGG ATTTGTATTG AGCTTTTACA GTATTCATTT TTCAACTCAA GGCAATGGCT 180 

TTCTACACCA ACTCTAATCC ATAAACQGGT CTTATGACAT CTATGAAGTA GTAGCAAGAC 24 0_ 

ATGCTTAGTG TGTATTTCTC TCTTTGAGAC ACTGTAATTT CTACCAGAAA TTTCCAGAGC 300 

ATTATGTAGG TAGAAAAAAA TGCAAGCAAG CTGTTAAAGA TCTTGGATCC CATTATATAG 360 

TATGTATAGC TGAAATCTGT AATTCAATCA CTTTTTCTCT TTTATCCTCT AACCAAAAAA 420 

TTGTTTAATT TTGCATCCCA AATGTTTTTA ATCTTTGTAT ATTTTTTAAA AAYCCTTTTC 480 

TCCTCATCAT TGCCTTTTTT GTGGTTGTAA ATAGACTTAC TTGCACTTTG AAGATGAGTT 540 

ACTCCTTGTC ATCTTACAAA TATGTGATAT GGTAATTTTC ATAACAGATG TCAGTTTTGA 600 

ACCAAGAATT GGTGATTTGT TTATAAGAAA AAAACTGGCT TCATTTCTGT GAAATTGCTC 660 

TTTGAAAATT TCTTTTTACA CGTGTAAGCC AACTGAGATA CCGTGATGGT GTTGATTTCT 720 

TTCAATGATG CTTACCATCT ATTTTAQCCA CTGAGCCTTT TATTATTTGT CTATTTGTAA 780 

AGTTTATTTG TCTTAACTCA TTTAATAAAT ATACTGTTTA TCTGTTTCTG AATGGGGACT 840 

GAACTTTTTG GATATTGATA TTGATTTGAA AATATTTrGG AATTTTTTCT ACTTGAAATT 900 

TTAGAAATCT AATKGAAAAT TCTATAATGT ACTGAAAGTA WGGTTGTGTA CAGTGAKCAC 960 
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TCTCTAATAA TATGATGNCT TGCXXTTAAAN GAGGNGGGAC ATGTCCCACT TTCCACCACG 1020 



,(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENC3TH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: doiible 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 198: 

AATTCCCGAA GCTGAGQGTT GTGTGCCNTC GGGOGAGCCA AGTCTTTTGA CCGGACCCTT 60 

CCCGGCGCAG AAGANCTGAA GTrGATTTGA GAGCCTGTKT TTGGGGTTRA GCCGAGCTGC 120 

20 

TGCGQGCTTY GTCGCCGGCC AGGACACAAG YTACTTGCAA CGGGGCGGCG CCTGGCITAT 180 

GATGTTCCTC AACCCAGGGG CGGCCTCTGC CCTCTACTCG TGCCAGGCCC ACTTGCCAGG 240 

25 CAGGAGCCCT CCCCAAGCCT TCAGGGCTGC TCGGAGTCAC CTGTTGGAAT GGACTAAAAG 300 

GACCCTTGTG TGGGAACAGG TGCTCCAAAC ACCCTGCTGC TGGCTGCCAG GCAGGCCCTC 360 

TGGAAGGGAA GGGGCAGGAC TCATCAGGAC CTCCCTGGAC CCTGCAGGGC AGGCAGTTQG 420 

CCCGAGCCCA AGCATTTGGC TCTGCTTGCC CCAAGGGGAC AGGAAGCCTC TTGGGCCTCT 480 

TCCCTTCCTG GACAAGGCCC CCTGCCTTTG CCTCACATAA ACTG 524 



30 



35 



55 



(2) INFORMATION FOR SEQ ID NO: 199: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
GTGATACAAG GAAGGGTGAT CATCATCTGT CACCATGCAA ITCCTGCTCA CAGCCTITCT 60 
50 GTTGGTGCCA CTTCTGGCTC TTTGTGATGT CCCCATATCC CTAGGCTTCT CCCCCTCCTA 120 
GAAGGGCTTC TTGATAGATT AGAAAATAAG AATGAGTGAC ATTTCCTATG TGCATATAAG 180 
AAGGAGCCAC AAGACATGTC TTTTAAATAA AAGGACAGTG TCCATCCTTT TAGCTGCCGA 240 
ATAGAACCTT GGTCTCATCC TCCTGGAGCT AGGSCTTAAA ACAGCTTCTG TGTTTCTSAT 300 
TKGTPCTCART GTTTTGCCAA GGTTTTATTC GG 332 



60 



SOOaO: <WO 9e3944aA2 I > 
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(2) INFX)RMATIC»^ FOR SEQ ID NO: 200: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENC?ni: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 



CCAGGGAAGC CCCARGCCTG TCCTGAATTG ACATCAGTGC TTCCCTGAAC TGCCTCCCCC 60 

15 ACCCCTGGGC ATTATCCCAG GAAACTTATG TTTTCTAGAA GCTAAGCAGC TQCTGGGACT ' 120 

CAGGGACTGG TGCAGGTAGG CTGAGTGGCA GCTCAGTCCT AGAAGGTCTC TGAAGATCTG 180 

GACTGAGGAC CYTGCTACTC CCCAAGCCAG AGCCCATCAG CCAGGCCTGC TCTTGAGCCAC 240 

20 

CTGCCTGTGG AGTGCTGAGC TCAACCAAAG GCTGGCAAGC TCTGGGCCTC ATTTAAGGGA 300 

TTCTGATGAG CCGATGQGCC CTGGAGGCAG CCCATTAAAG CATCTGGCTC GTTTTTGGAA 360 

25 AAAAAAAAAA AAAAAG 376 



30 (2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 



40 CCCAGTATAT TTCTATAACA TTTATTTrAG TGAACTTATA ATGTTTCTTT GTATTAAATT 60 

ATTAGATTAT ATCTTTAGAT AATATrGTTA CTNAATTAGT AGGTAATATA TATTTTATTC 120 

AAAAATAAAT TGTGCATCTA ATGTCTACCA ATTAATGTAC TTCTAGATGT ATCTTATCTT 180 

45 

AACTTGAGTC TTTGCTGCCC CTAATGAGGT GTGAAGGACT CTTCTCCCCT GGGGAAGTTT 240 

TTCTTTTTCA GGAGGGAGGA GGGCTTTCCC AGGTAATGTG TCTAGAGrTGT TGGGCAGAAR 300 

50 AATCTGGGAC CACACCACAC CAGTTCTCTC CTTAATCCAC GTCATTTGCC TTCTATCCCA 360 

GCTATGTTTC CAGTGTCCTC TQGGTGTTTC CAAGAGCAAC AAGAAAYGAA TAAATCTCTG 420 

KTGAGTTGTT TATTTGTTCT TCACTTTGTT TTACACTGTA WTTTCTGAGT TTATGGGTGT 480 

55 

CTGTGAATTA AAAAGGAAAA GTRGAAATAA GTAAAACTCA GGTTGAAGGA AATATACATA 540 

AATAAGATAA AGCTGACCTG TAGATATARR CAGGTTATAA RAGCTTAGAG TTGTCTAAGT 600 

60 TGRGTGCAAA KTTTCCTCTG ATCTTTCTGA TGCCGARACA AAAAAGGCAG TCATGTTTGT 660 
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WATGTGATTG GAATQGAACC CGARAAGAGA GCAYGCTGTG TTCTTGGGGA CAGGAAAGCT 720 

TGYGTGCACC AAGTCTKAAC CACCACCTTC ATGGGACATA GRTTATGTGC TGGAACATAT 780 

5 

TTCACACCGG CCTGGCAGTA AACACTTGTA GTGTTGTGCA GTGGAAACGG TCATCTTCCG 840 

CTAAAGCACG GCGTGTTGTG CAGCGGAAAT GGTCATCTGC TGCTAAAACA CAGCTTCCAT 900 

10 CGTAATGTAT GCTCCTTACT CAAAGAGTGT GGTCCCAAAC AGCCTTTGGG AGGTCCTCCT 960 

TGATTCATGG ATGAAACCTG GAACATCTTG AGGACTGAGT TAACCATAGG TCCTTAAATA 1020 

ACTCTCCACA CGTTTTTCTT AGTTTATCTC TACATGCAGG GTGTGCAGCA GCXTTCTTCAA 1080 

15 

AGTCATATTT TCTGGGAAAT ATTTCCAGTG TTTATTTGCA CTTTAGCCCA CTCTGTGTAG 1140 

CCTTATTTCT TCTAAACTCA CCATTAATCT GAATAATAGT CAAATTTAGG GG 1192 

20 

(2) INFORMATION FOR SEQ ID NO: 202: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
ATCTTGGGCT ATCTTTGACA GGGGATTCTT GCAAGTTGAT GCTTTCTACA AGTGAATATA 60 
35 GTCAGTCCCC AAAGATGGAG AGCTTGAGTT CTCACAGAAT TGATGAAGAT GGAGAAAACA 120 

GAGAGATTGA-GGATAGGGAA-GGeATGTeTe~eAGTTeTeAA--TTeTAAAl^ IBO" 

AAAATGATAG TATCCTGATG AATCCAGCAC AGGATGGTGA AGTACAACTG AGTCAGAATG 240 

40 

ATGACAAAAC AAAGGGAGAT GATACAGACA CCMGGGATGA CATTAGTATT TTAGCCACTG 300 
GTTGCAAGGG CAGAGAAGAA ACGGTAGCAG AAGATGTTTG TATTGATCTC ACTTGTGATT 360 
45 CGGGGAGTCA GGCAGTTCCG TCACCAGCTA CTCGATCTGA GGCACTTTCT AGTGTGTTAG 420 
ATCAGGAGGA AGCTATGGAA ATTAAAGAAC ACCATCCAGA GGAGGGGTCT TCAGGGTCTG 480 
AGGTGGAAGA AATCCCTGAG ACACCTTGTG AAAGTCAAGG AGAGGAACTC AAAGAAGAAA 540 
ATATGGAGAG TGTTCCGTTG CACCTTTCTC TGACTGAAAC TCAGTCCCA 589 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 203: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTO: 847 base pairs 
60 (B) TYPE: nucleic acid 



SOOCID: <WO_9839448A3J_> 
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(C) STRANDEODNESS : double 

(D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

GGCACGAGCG CAAGCTGCTG GCCGCCATCA ACGCGTTCCG CCAGGTGCGG CTGAAACACC 60 

GGAAGCTCCG GGAACAAGTG AACTCCATGG TGGACATCTC CAAGATGCAC ATGATCCTOT 120 

10 ATGACCTGCA GCAGAATCTG AGCAGCTCAC ACCGGGCCCT GGAGAAACAG ATTGACACGC 180 

TGGCGGGGAA GCTGGATGCC CTGACTGAGC TGCTTAGCAC TX5CCCTGGGG CCGAGCAGCT 240 

TCCAGAACCC AGCCAGCAGT CCAAGTAGCT GGACCCACGA GGAGGAACCA GGCTACTTTC 300 

CCCAGTACTG AGTQGTGGAC ATCGTCTCTG CCACTCCTGA CCAGCCTGAA CAAAOCACCT 360 

CAAGTGCAAG GACCAAAGGG GGCCTGGCTT GGATCGGTTG GCTTGCTGAT GGCTGCTGGA 420 

20 GGGGACGCTG GCTAAAGTGG GGAGGCCTTG GCCCACCTGA GGCCCCAGGT GGGAACATOG 480 

TCACCCCCAC TCTGCATACC CTCATCAAAA ACACTCTCAC TATGCTGCTA TGGACGACCT 540 

CCAGCTCTCA GTTACAAGTG CAGGCGACTG GAGGCAGGAC TCTTGGGTCC CTGGGAAAGA 600 

25 

GGGTACTAGG GGCCCGGATC CAGGATTCTG GGAGGCTTCA GTTACCGCTG GCCGAGCTGA 660 

AGAACTGGGT ATGAGGCTGG GGCGGGGCTG GAGGTGGCGC CCCCTGGTGG GACAACAAAG 720 

30 AGGACACCAT TTTTCCAGAG CTGCAGAC5AG CACCTGGTGG GGAGGAAGAA GTGTAACTCA 780 

CCAGCCTCTG CTCTTATCTT TGTAATAAAT GTTAAAGCCA GAAAAAAAAA AAAAAAAAAA 840 

AAAAAAA 847 

35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 204: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOIOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

ACAAACATAC TCGCAGGAAG GAGTCTCATG CTGCCCGCAG CATCAGCGCA ACNCNTGGCC 60 

GCCATCAACG CGTTCCGCCA GGTGCGGCTG AAACACCGGA AGCTCCGGGA ACAAGTCAAC 120 

TCCATGGTGG ACATCTCCAA GATGCACATG ATCCTGTATG ACCTGCAGCA GAATCTGAGC 180 

55 AGCTCACACC GGGCCCTGGA GAAACAGATT GACACGCTGG CGGGGAAGCT GGATGCCCTG 240 

ACTGAGCTGC TTAGCACTGC CCTGGGGCCG AGGCAGCTTC CAGAACCCAG CCAGCAGTCC 300 



AAGTAGCTGG ACCCACGNAG GAGGAACCAG GCTACTTTCC CCAGTACTGA GGTGGTGGAC 360 

60 
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ATNCGTCTCT TGCCACTCCN TGNACCCAGC CCTGAACAAA GCACCTCAAG TGCAAGGACC 420 

AAAGGGGGCC CTGGCTTGGA GTGGGTTOGC TTGCTGATGG CTGCTGGAGG GGACX3CTGGC 480 

> TAAAGTTGGGK AGGCCTTQGC CCACCTGAGG CCCCAGGTGG GAACATGGTC ACCXXrCACTC 540 

TGCATACCCT CATCAAAAAC ACTCTCACTA TGCTGCTATG GACX3ACCTCC AGCTCTCAGT 600 

TACAAGTCCA GGCX3ACTC3GA GGCAGGACTC CTGGGTCCCT GGGAAAGAGG GTACTAGGGG 660 

CCTGGAICCA GGATTCTGGG AGGCTTCAGT TACCGCTGGC CGAGCTGAAG AACTGGGTAT 720 

GAGGCTGGGG CGGGGCYGGA GGTGGCGCCC CCTGGTGGGA CAACAAAGAG GACACCATTT 780 

15 TTCCAGAGCT GCAGAGAGCA CCTGGTGGGG AGGAAGAAGT GTAACTCACC AGCCTCTGCT 840 

CTTATCTTTG TA B52 



10 



20 



(2) INFORMATION FOR SEQ ID NO: 205: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUOGYi linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

GATTCGGCAC GAGGCTTGCT GGAGCAGGAG AAGTCTCTRG CCGGCTGGGC ACTGGTGCTG 60 
GCASGARCTG GCATTGGACT CATQGTGCTG CATGCAGAGA TGCTGTGGTT CGGGGGGTGC 120 

35 

TCGGCTGTCA ATGCCACTGG GCACCTTTCA GACACACTTT GGCTGATCCC CATCACATTC 180 



CTGACCATCG GCTATGGTGA CGTGGTGCCG GGCACCATGT GGGGCAAGAT CGTYTGCCTG 240 

40 TGCACTGGAG TCATQGGTGT CTGCTGCACA GCCCTGCTGG TGGCCGTGGT GGCCCGGAAG 300 

CTGGAGTTTA ACAAGGCAGA GAAGCACGTG CACAACTTCA TGATGGATAT CCAGTATACC 360 

AAAGAGATGA AGGAGTCCGC TGCCCGAGTG CTACAAGAAG CCTGGATGTT CTACAAACAT 420 

45 

ACTCQCAGGA AGGAGTCTCA TGCTGCCCGC AGGCATCAGC GCAANCTGCT QGCCGCCATC 480 

AACGCGTTCC GCCAGGTGCG GCTGAAACAC CGGAAGCTCC GGGAACAAGT GAACTCCATG 540 

50 GTGGACATCT CCAAGATGCA CATGATCCTG TATGACCTGC AGCAGAATCT GAGCAGCTCA 600 

CACCGGGCCC TGGAGAAACA GATTGACACG CTGGCGGGGA AGCTGGATGC CCTGACTGAG 660 

CTGCTTAGCA CTGCCCTGGG GCCGAGGCAG CTTCCAGAAC CCAQCCAGCA GTCCAAGTAG 720 

55 

CTGGACCCAC GAGGAGGAAC CAGGCTACTT TCCCCAGTAC TGAGGTGGTG GACATCGTCT 780 

CTGCCACTCC TGANCCCAGC CCTGAACAAA GCACCTCAAG TGCAAGGACC AAAGGGGGCC 840 

60 CTGGCTTGGA GTGGGTTGGC TTGCTGATGG CTGCTGGAGG GGAOGCTGGC TAAAGTOGGK 900 



SDOaCh <WO_9839448A2J_> 
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AGGCCTTGGC CCACCTGAGG CCCCAGGTGG GAACATGGTC ACCCCCACTC TGCATACCCT 960 

^ CATCAAAAAC ACTCTCACTA TGCTGCTATG GACGACXTTCC AGCTCTCAGT TACAAGTOCA 1020 

GGCGACTGGA GGCAGGACTC YTGQGTCCCT GGGAAAGAGG GYACTAGGGG CCCGGATCCA 1080 

GGATTCTGGG AGGCTTCAGT TACCGCTGGC CGAGCTGAAG AACTGGGTAT GAGGCTOGGG 1140 

10 CGCX3GCTGGA GGTGGCGCCC CXrTGGTGGGA CAACAAAGAG GACACCATTT TTCCAGAGCT 1200 

GCAGAGAGCA CCTGGTGGGG AGGAAGAAGT GTAACTCACC AGCX^CTGCT CTTATCTTTG 1260 

TAATAAATGT TAAAGCCAGA AAAAAATAAA AAAAAAAAAA AAAAAACTCG AGGGGGGCCX: 1320 

15 

AGACCCAATC TCXCTATAGT AAGNCGCCNN ANAN 1354 



20 



30 



40 



<2) INFX3RMATION FOR SEQ ID NO: 206: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEEWESS: double 
<D) TOPOUOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

TCCCCAGGTG CACAGCCAGG GCCCTCCTGT CTGCAQGAGA ATTCACAGCT C3GTGTGGGAC 60 

TCAGCCCCTA GNCCATTCAA AGCCTTAATG TTGTAATCAT ATCTTACGTG TTGAAGACCT 120 

35 GACTGGAGAA ACAAAATGTG CAATAACGYG AATTTTATCT TAGAGATCTG TGCAGCCTAT 180 



TTCTGTCACA AAAGTTATAT TGTCTAATAA GAGAAGTCTT AATGGCCTCT GTGAATAATG 240 

TAACTCCAGT TACACGGTGA CTTTTAATAG CATACAGTGA TTTGATGAAA GGACGTCAAA 300 

CAATGTGGCG ATGTCGTGGA AAGTTATCTT TCCCGCTCTT TGCTGTQGTC ATTGTGTCTT 360 

GCAGAAAGGA TGGCCCTGAT GCAGCAGCAG CGCCAGCTGT ANATAAAAAA TAATTCACAC 420 

45 TATCAGACTA GCAAGGCACT AGAACTGGAA AAGACCACAG AAAACAAAGA ATCCAACCCT 480 

TTCATCTTAC AGGTGAACAA ACTGTGATGA TGCACATGTA TGTGnTTGT AAGCTGTGAG 540 

CACCGTAACA AAATGTAAAT TTGCCATTAT TAGGAAGTGC TGGTGGCAGT GAAGAAGCAC 600 

50 

CCAGGCCACT TGACTCCCAG TCTGGTGCCC TGTCTACACC AGACAACACA GGAGCTQGGT 660 

CAGATTCCCC TCAGCTGCTT AACAAAGTTC CTCGAACAGA AAGTGCTTAC AT^GCTGCCT 720 

55 TCTCGGATAC TGAAAGGTCG AGTTTTCTGA ACTGCACTGA TTTTATTGCA GTTGAAAAAA 780 

AAAAAAAGCT ATTCCAAAGA TTTCAAGCTG TTCTGAGACA TCTTCTGATG GCTTTACTTC 840 

CTGAGAGGCA ATGTTTTTAC TTTATGCATA ATTCATTGTT GCCAAC3GAAT AAAGTGAAGA 900 

60 
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AACAGCACCT TTTAATATAT AGGTCTCTCT 
GTGACAATTT TCATATTCTC ATTCTTAAAA 
5 TGAGAATAAG TTACACACAA TGGCCACAGC 
TCATGTAATC GGTTACTCAC TACTGCCTTT 
CATGAGACAG GATTATAGTG CCTTAACCGA 

10 

AACTGCTCTT CTGCTCTAGT ACCATQCTTA 
TGCTTGTTCT TATTTTAATA AATTTATCAG 
15 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



435 

GGAAGAGACC TAAATTAGAA AGAGAAAACT 960 

AACACTAATC TTAACTAACA AAAGTTCTTT 1020 

AGTTTGTCTT TAATAGTATA GTGCCTATAC 1080 

AAAAAAAAAA ACCAGCATAT TTATTGAAAA 1140 

TATATTTTGT GACTTAAAAA ATACATTTAA 1200 

GTGCAAATGA TTATTTCTAT GTACAACTGA 1260 

AGTGAAAAAA AAAAAAAAAA AAAAAAAAAA 1320 

AAAAAAAAAA AAAAAAAGAA NAAAANAA 1378 



20 (2) INFORMATION FOR SEQ ID NO: 207: 

(i) BEQUEUCE CHARACTERISTICS: 

<A) liElOGTH: 1166 base pairs 
(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

30 AANCCACTGC AOTTTAAACC CCCTCCCCTC CAAGAAAGTT CACAACCGGC CATGGATGAC 60 

CCTCATTTTA GATGGGCCNC AATATTTAAG ATGGACTGRG GMCCCCARAG ACTGACCCTT 120 

GAAAGGGGGA CTCAGAAGAA AGATCCTTGA CATTGCCMAA CATGCTGGGC TTGTCCAACA 180 

35 

CAGTGATGCG GCTCATCGAG AARCGGGCTT TCCMAGGACA AGTACTTTAT GATAGGTGGG 240 

ATC3CTGCTGA CCTGTGTGGT CATGTTCCTC GTGGTGCAGT ACCTGACATG AGCCAGCCAC 300 

40 GCTCAGTGGC TGAACAGCAT TCCCACAGCC TGCAAGTGTG TGTGTGTGTG AAAGAGAGAG 360 

GGGGCCCAGA GGCCGCCTTT TGAAATGTTT GCCTGTCTGA ACTGTGAAGA CACTTGGGAG 420 

TGATTGTGGT CTAATTTCCA ACCTGCTCTG TTTTCTGTGA CATCTTQGAG GGGGAGCTAG 480 

45 

TGCCAMCACC ATGCGCGGTG CTTAGGAAAT GAAAGAAGTC CCGGGTCTGT CTCTCTCACT 540 

CTCGCTCTCA MTGGGGGAGG GAAAGAATGG CTTTGGTGGC TTTGTTCACA CAGCTGATGC 600 

50 GTGSCCTGGG AAGGTGTCCA GAGTGAGCCC TGTGTGCAGG ACTGTCCACN ACGGTTCACA 660 

CCTTGTCACC ATCAGGCCTT TCTGGCTCCT GATAGGGTGG AGCAAAAGTG GAAAGGAAAG 720 

GAAAGAGGCY TTTTCTCACA GCCATTATAT TAAATAGTAG GTCGATTCAC ATCYTCGTCC 780 

55 

TCCTGGCCAC CCTCCCCTGT GCCTCAGTGA CATGTAGATG ACTGACTGCC AATACTTC3TC 840 

ACCATTCCCT GGAAGCAGCT ACCTAGGGGA AACAAGATGT AGTGCTATTG CCGATAACAA 900 

60 GTAAGATTTT CCACACTACA GCTGGGTGTT TCTCTTTTCT AAAGTGAGGC CAGTGTTATT 960 
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TCCCGGGAGT GTTCAGTCTT GACCCTAGTC ACTGATTTTT TCTAGTTCTT AATAGAGTGG 1020 

TTGGGCTTTT AAGGTTCAGA GACTGTGGGC TTGGGCACTT GCGCCCAGGG SmTGTGGG 1080 

5 

GGCCTTTGCX: CXTTAGRAAA GTAGCnTTA GGGGCAAAGA TTTGTTGAIT TTCCXTCATTA 1140 

Cy^GTCTTCAG CTCNAGGGTT TTAAAA 1166 

10 



(2) INFORMATION FOR SEQ ID NO: 208: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LEIKSTH: 697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANZDEONESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
TACTTCTAGG ATTATAAGGA ATTAACATTG AGATGACATT TCCATTTGAG AAGGAAAATA 60 
25 GTTGCTTTCA GTGCCTTTTA TTTGATTCCT GGAGAGAGCA GACTCGCACS AACATTCAAC 120 
CCCAGCGCTG ATATGACAGT AATCCTCAGA GGCAGAGCCC AGCACAAAAC AGCAATGCTA ISO 
GAAAGTTACA ATTGGAAAGT TTCCTGCCAG CTTCGGGAAT GACACTGCAA AGCTGATGCC 240 

30 

AGAAACTGCC AGRGT7ATTC TCCTCATTAC TGCTCTACCC ACCCACTTTC AGCTCCCCAA 300 
ATTAACTAGT GCAGTTGACT AATTCTCTTT ACCTTTATCA TTTARGGTGA RGCATTGCAC 360 
35 AAAAACTCTC GACTTTGCCA TATAAGGGCT GTGGTTCTCT GTGGTCCCCT GGATAAGAGG 420 



CATCACCATT ATCTGGAAAC ATGCAGTAAA TGCAGATTNT TCATCTTCTC CCCAGACCTC 480 

CTGAGTTAGA AATTCACAAG TTCTCCAGGT GATCTCATAC ATGCTAAAGT TTGAGAACCA 540 

40 

TTGAGTAAAG TTAATGCATT AAGAAGAGAT TAGATAGGGA TGGTGGCGTA TCTTCCTACA 600 

GTTTCCCTGT TAACAAGAAA GTCAGAGGTC AGTTGATCAG ACATTAGATT ATTTATTGCT 660 

45 AAAACTAAAA AAAATTAAAA AAAACTGGAG QGGGGCC 697 



50 (2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 
60 CGTGAGTCAC CTCTCTATAG TGGGCGTGGC CGAGGCOGGG GTGACCCTGC CGAAGCCTCC 60 
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GCTGccAGAA acx:atgttca AGGTAATTAA AAGGTCCGTG GGGCCAGCCA GCCTGAGCTT 120 

GCTCACCTTC AAAGTCTATG CAGCACCAAA AAAGGACTCA CCTCCCAAAA ATTCCGTGAA 180 

5 

GGTTGATGAG CTTTCACTCT ACTCAGTTCC TGAGGGTCAA TCGAAGTATG TGGAGGAGGC 240 

AAGGAGCCAG CTTGAAGAAA GCATCTCACA GCTCCGACAC TATTGCX3AGC CATACACAAC 300 

10 CTGGTCTCAG GAAACGTACT CXXZAAACTAA GCCCAAGATG CAAAGTTTGG TTCAATGGGG 360 

GTTAGACAGC TATGACTATC TCCAAAATGC ACCTCCTC3GA TTTTTTOCGA GACTTGGTGT 420 

TATTGGTTTT GCTGGCCTTA TTGGACTCCT TTTGGCTAGA GGTTCAAAAA TAAAGAAGCT 480 

AGTGTATCCG CCTGGTTTCA TGGGATTAGC TGCCTCCCTC TATTATCCAC AACAAGCXM' 540 

CGTGTTTGCC CAGGTCAGTG GQGAGAGATT ATATGACTGG GGTrrAOGAC5 GATATATAGT 600 

20 CATAGAAGAT TTOTGGAAGG AGAACTTTCA AAAGCCAGGA AATGTGAAGA ATTCACCTGG 660 

AACTAAGTAG AAAACTYCAT GYTCTGCCAT CTTAATCAGT TATRGGTAAA CATTGGAAAC 720 

TCCATAGAAT AAATCAGTAT TTCTACAGAA AAATGGCATA GAAGTCAGTA TTGAATGTAT 780 

25 

TAAATTGGCT TTCTTCTTCA GGAAAAACTA GACCAGACCT CTGTTATCTT CTGTGAAATC 840 

ATCCTACAAG CAAACTAACC TGGAATCCCT TCACCTAGAG ATAATGTACA AGCCTTAGAA 900 

30 CTCCTCATTC TCATGTTGCT ATTTATGTAC CT 932 



35 (2) INFORMATION FOR SEQ ID NO: 210: 

. - _ _ SEQUENCE-CHARACTERISTICS: - — 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

45 GTCATTCTTT AAATAAAAGC nTCCTGTTT AAAGCTTTTC AAAGGAGCAG ACCACCTTGA 60 

AGATTCCCCC TAGGGTTGAT ATGTGTCTAA TTCATTTTAT AAAAATTATT CTTGTCTTCA 120 

TTTTAAAGCT TTGGCTATAT AGTCAGAAAT GTCCTAAATA T^CAAACTATT TTGTATTTAA 180 

50 

TTTAQGGAAG ACTAAAGGGA AGAAAAATGA AAACTCAGTC TTTATGTAAG CTCCAAGGAT 240 

ATTAGGGCTT AAAGGGCTTT TCrAGTTTTA TGAGAATTTG TACTACTGAT TTTTATATAT 300 

55 TCCTGTTTTT GAGATGAACA GATCTCTGGG GAAATTGTTG AGTTACAATG GCATTTCACT 360 

GTGATCCCTC TCAAGCTCAG ATCAGTTCTA TAACCCAATG ACAACCTGTC TCTTTGGTTT 420 

ACTGTCCTGT GAAATGTCAG CTCAAGTTTC CCAGAAGTCG TGTGTTTATG ATGAGTCAGA 480 

60 
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GTGCTTTTCC TCGGTGGGAC AGTTGCTGGC CCTCTTAATT TrGGTGTATG TCCTTCCAAG 540 

TATCTAAACX: TCCAGTCTGA TCTGTATATG CTATCCTAAC TGTTAATTGT ATTATTGATT 600 

5 ATGTTGATTA TCTTGCTTGA AGGTTCATAC TTTTCAATTT GATAGAAATA AAGTmTTT 660 

C 661 

10 

(2) INFORMATION FXDR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 

(D) TOPOLOGY: lineeir 

20 (xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 211: 

GAAACTGACA TTGTTAAACA CACTAAAACA GAAGTACTTA CCTCTTGAAG ATTTAATATA 60 

TAATGGTTGA CATGATACAT GTACATGAAT GGAATGACCA GATGCTTATG GTCTACATTT 120 

25 

TCCTTTATCC TGTTAGTATT ACCTTCCTTA ATCTTTGTTC CTTAACATGC TAAATTCCTC 180 

TTCAGTGTTT ATTTTCTAGT GACAGAATGC TAACATTTCT TACACCCTGG CAGAAGGGAG 240 

30 AGAAATGTGT TTTGGGGTGG GTAACTAAAT TTTTGAGTGA AATATCATAA GATGAGAATG 300 

GAAAGAGGGA GACACAAAGA GTTATAACAA AAAAACAATG GTTTTTTTAG CCATTTGACT 360 

GGCTCTTTAA ATAGTCTACA AGACATTCAC GTTNAACATC ACTTTTAGTG AAATAAAATG 420 

35 

TGCCATACTA GTATGTGCTT CAAAAGGGCA AATGTGCTT T AGTGCCCTAA GGCTAAATTT 480 

TGGTCATTTG ACATCAGAGA TGTTGTAAGT ATTGCACTTA ATACGCACCT ATTTCTCAAT 540 

40 AGTGNTATTT TTTTGGCTAG CATTTNCTTT ACCACTAACC TTGTTGGATA GC 592 



45 (2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 938 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 
55 TGGAGTGGCT TTCCAGCTGA ATGAATCCTA TGTCTCGCGT GCAGGTGGTT GGTTTTCAAT 60 
GTTCTTSCTA ATTTTTTTCC TATTGGCTCT TGGGAGTTTN CTrTGTTTGC TCCTGTGTTT 120 
GCCCAGCTTT AATAAAACCA GGCGCAAACA AAAACCATAG CATTCTGAAA CAATAGGGGG 180 

60 
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CCCACATTGG ACCCAGTATG TCACTTTAAT GGACTTCAAG AAAAAATCTG AATGGGAAAA 240 

TGACACTAGG AATGTATACT CCACACATTT TATGCCATAT AATGGTGTOT TTTCTTAArr 300 

TTGTTTCTTG TGGCGAAATG TGGCTTTCAA ATTAAAATCM CCTTTTCTTC TTKGAAACTT 360 

TTTGTTTKGA CTKGTATAAT TAAGGGTTTG GAAAGATTCA TAATTMrGAG AGAGGTTTGC 420 

AACCAGGAGA TACAAAGAAG TCTCAGTAGT AATCTTGTTC ATGTGCTTTT ACAGCCAC5CT 480 

ACATTTAAGR ATGTATTAGT TACAGAAATT ATATGTCTGT GTATGTGTCT CTACTCAATA 540 

AAGTACATGC CTCCACATAA TQCGGTGCTG TCCATCTCGG CAAATACTGG CX:AAGTCCCT 600 

15 TTATGACAGG CACACAGAAA CCATAGCATG GTCTGGCTTT CAGAAAATGC CTCTCATCTT ' 660 

TCCTGGAACC TTATTTTGCT AAATGTCTGT TTTCTTGTGA TTTGTTGTAC CTCACAGCAC 720 

CATTGTGACC ATGGTGATGC CTCATTTGCA TGATATGTAC CTTGTGTTTA ATGTGAAATA 780 

20 

CATTTTCATT GAAGAGTCTG ATGACTTGCT AGCXnTTTAT TTTTTCTGTA AGCTCAATGT 840 

GCTGAAACCA AACCAGGCTT TTAAAAACCT GTGTAGAAGA AAACCAAAAA ATCCTGTOTG 900 

25 GGTGTCCTTT CCCTGTCAAA CTCATTAAAA ATTCCTTT 938 



30 (2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1079 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

40 AGCCTGCCGG GAGAGTGGTG GCATCTRARA GGCTGGTCGT GGACTGTGGT TGGGGGAGGT 60 

GGGAGCTGTT TTAACCGTGT GCCCCCTCTC CTGTGCCKGC GTGGGCATCC CCCGGGGCAG 120 

TGGAACGCGG GCGCTCCTCC AGCTTCCGAG TCCAGCCAGC CTGGGCGCGG GGCGCGCCCC 180 

45 

CGAGACACCC GAGGAGTCCG TTCCTCCCTG GTTACGTGGA CTGTGGAGCT GGTCTCTTGT 240 

GGCTCAGCGC CGTGCGGAGG TTGAAGCGTA CCTGCGGAGG TCGCACCAGG GGCGTGAGGA 300 

50 GGAGGAGGAA GGGCATGAGC CGAGCTTGAG GAATCCGTGY TCCAAACTCT ACACTCAAGG 360 

RTGCMCTGCG CAACTCTGGT GGCGATGGGC TGGGGCAGAT GTCCTTGGAG TTCTACCAGA 420 

AGAAGAAGTC TCGCTGGCCA TTCTCAGACG AGTGCATCCC ATGGGAAGTG TGGACGGTCA 480 

55 

AGGTGCATGT GGTAGCCCTG GCCACGGAGC AGGAGCGGCA GATCTGCCGG GAGAAGGTGG 540 

GTGAGAAACT CTGCGAGAAG ATCATCAACA TCGTGGAGGT GATGAATCGG CATGAGTACT 600 

60 TGCCCAAGAT GCCCACACAG TCGGAGGTGG ATAACGTGTT TGACACAGGC TTCCGGG^CG 660 



SOOCtD: <WO_9839448A2Ll_> 



wo 98/39448 



440 



PCT/US98/04493 



TGCAGCXXTTA CCTGTACAAG ATCTCCTTCC AGATCACTGA TGCCCTGGGC ACCTCAGTCA 720 

CCACCACCAT GCX3CAGGCTC ATCAAAGACA CCCTTGCCXT CTGAGCX?rCG CTGGATCTCT 780 

5 

GGGAGCTCCT TGATGGCTCC CAGACCTTGG CTTTTGQGAA TTGCACTTTT GGGCCITTQG 840 

. GCTCTGGAAC CTGCTCTGGG TCATTGGTGA GACTTGGAAG GGGCAGCXTCC CGCTGGCTTC 900 

10 TTGGTTTTGT GGTTGCCAGC CTCAGGTCAT CCTTTTAATC TTTGCTGACX3 GTTCAGTCCT 960 

GCCTCTACTG TCTCTCCATA GCCCTGGTGG GGTCCCXXTTT CTTTCTCCAC TCTACAGAAG 1020 

AGCXACCACT GGGATGGGGA ATAAAGTTGA GAACATGAGT TTGGGCTGAA AAAAAAAAA 1079 

15 



20 



(2) INFORMATION FOR SEQ ID NO: 214: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 
25 (D) TOPOIiOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

TGAAGCAGGC GCTCTTGGCT CGGCGCGGCC CGCTGCAATC CGTGGAGGAA CGCGCCGCCG 60 

AGCCACCATC ATGCCTGGGC ACTTACAGGA AGGCTTCGGC TGCGTGGTCA CCAACCGATT 120 

CGACCAGTTA TTTGACGACG AATCGGACCC CTTCGAGGTG CTGAAGGCAG CAGAGAACAA 180 

35 GAAAAAAGAA GCCGGCGGGG GCGGCGTTGG GGGCCCTGGG GCCAAGAGCG CATCAGGGCC 240 

-GCGGeeeAGA- CCAACTCCAA- CGCGGCAGGC AAACAGCTGC- OCAAGGAGTC'CCAGAAAGAC 3 00 - 

CGCAAGAACC CGCTGCCCCC CAGCGTTGGC GTGGTTGACA AGAAAGAGGA GACGCAGCCG 360 

40 

CCCGTGGCGC TTTAAGAAAG AAGGAATAAG ACGAGTTGGA AGAAGACCTG ATCAACAACT 420 

TCAGGGTGAA GGGAAAATAA TTGATAGAAG ACCAGAAAGG CGACCACCTC GTGAACGAAG 480 

45 ATTCGAAAAG CCACTTGAAG AAAAC5GGTGA AGGAGGCGAA TTTTCAGTTG ATAGACCGAT 540 

TATTGACCGA CCTATTCGAG GTCGTGGTGG TCTTGGAAGA GGTCGAGGGG GCCGTGGACG 600 

TGGAATGGGC CGAGGAGATG GATTTGATTC TCCTGGCAAA CGTGAATTTG ATAGGCATAG 660 

50 

TGGAAGTGAT AGATCTTCTT TTTCACATTA CAGTGGCCTG AAGCACGAGG ACAAACGTGG 720 

AGGTAGCGGA TCTCACAACT GGGGAACTGT CAAAGACGAA TTAACTGACT TGGATCAATC 780 

55 AAATGTGACT GAGGAAACAC CTGAAGGTGA AGAACATCAT CCAGTGGCAG ACACTGAAAA 840 

TAAGGAGAAT GAAGTTGAAG AGGTAAAAGA GGAGGGTCCA AAAGAGATGA CTTTGGATGA 900 

GTGGAAGGCT ATTCAAAATA AGGACCGGGC AAAAGTAGAA TTTAATATCC GAAAACCAAA 960 

60 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



TGAAGGTCCT GATGGGCAGT. GGAAGAAGGG ATTTGTTCTT CATAAATCAA AGAC3TGAAGA 1020 

GGCTCATGCT GAAGATTCGG TTATGGACCA TCATTTCCX3G AAGCCAGCAA ATGEATATAAC 1080 

GTCTCAGCTG GAGATCAATT TTGGAGACCT TGGCCGCCCA GGACGTGGCG GCAGGGGAOG 1140 

ACGAGGTOGA CGTOGGCGTG GTGGGCGCCC AAACCGTGGC AGCAGGACCG ACAAGTCAAG 1200 

TGCTTCTGCT CCTGATCTGG ATGACCCAGA GGCATTCCCA GCTCTGGCTT AACTGGATGC 1260 

CATAAGACAA CCCTQGTTCC TTTGTGAACC CTTCTGTTCA AAGCTTTTGC ATGCTTAAGG 1320 

ATTCCAAACG ACTAAGAAAT TAAAAAAAAA AAGACTGTCA TTCATACCAT TCACACCTAA 1380 

ACACTGAATT TTATCTGTTT TAAAAATGAA CTTCTCCXXX: TACACAGAAG TAACAAATAT ' 1440 

GGTAGTCAGT TTTGTATTTA GAAATGTATT GGTAGCAGGG ATGTTTTCAT AATTTTCAGA 1500 

GATTATGCAT TCTTCATGAA TACTTTTGTA TTGCTGCTTG CAAATATGCA TTTCCAAACT 1560 

TGAAATATAG GTGTGAACAG TGTGTACCAG TTTAAAGCTT TCACTTCATT TGTGTTTTTT 1620 

AATTAAOGAT TTAGAAGTrC CCCCAATTAC AAACTGGTTT TAAATATTGG ACATACTGGT 1680 

TTTAATACCT GCTTTGCATA TTCACACATG GTCAACTGGG ACATGTTAAA CTTTGATTTG 1740 

TCAAATITTA TGCTGTGTGG AATACTAACT ATATGTATTT TAACTTAGTT TTAATATTTT 1800 

CATTTTTGGG GAAAAATCTT TTTTCACTTC TCATGATAGC TGTTATATAT ATATGCTAAA 1860 

TCTTTATATA CAGAAATATC AGTACTTGAA CAAATTCAAA GCACATTTGG TTTATTAACC 1920 

CTTCCTCCTT GCATGGCTCA TTAGGTTCAA ATTATAACTG ATTTACATTT TCAGCTATAT 1980 

TTACTTTTTA AATGCTTGAG TTTCCCATTT TAAAATCTAA ACTAGACATC TTAATTGGTG 2040 
-AAAGTTGTTT -AAACTA6OTA-TTSTTGGTAG-GeAeATCGTG-TCAAGTGA^ - -2100 " 

GGTATGGGTT TTTTCTCCCC CTTCACCAGG GTGGGTGGAA TAAGTTGATT TGGCCAATGT 2160 

GTAATATTTA AACTGTTCTG TAAAATAAGT GTCTGGCCAT TTGGTATGAT TTCTGTGTGT 2220 

GAAAGGTCCC AAAATCAAAA TGGTACATCC ATAATCAGCC ACCATTTAAC CCTTCCTTGT 2280 

TCTAAAACAA AAACCAAAGG GCX3CTGGTTG GTAGGGTGAG GTGGGGGAGT ATTTTAATTT 2340 

TTGGAATTTG GGAAGCAGAC AGCTTTACTT TGTAAGGTTG GAACAGCAGC ACTATACATG 2400 

AAATATAAAC CAAAAACXTTT TACTGTTTCT AAATTTCCTA GATTGCTATT ATTTGGTTGT 2460 

AAGTTGAGTA TTCCACAGAA AGTGGTAATT ATCTCTTCTC TCTTCCTCCA TTAGAAAATT 2520 

AGGTAAATAA TGGATTCCTA TAATGGGAGC ATCACX::ACTT ATTAAAACAC ACATAGAATG 2580 

ATGAATTAAA AAAGTTTTCT AGGATTGTCT TTTATTCTGC CACATTTATT GATAAACAGT 2640 

GAAGGAATTT TTAAAAAATT TTTAAGAATT GnTGTCACG TCATTTTTAG AAATGTTCTA 2700 

CCTGTATATG GTAATGTCCA GTTTTAAAAA TATTGGACAT CTTCAATCTT AAACAnTCT 2760 
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ATTTAGCTGA TTGGTTCTCA CATATACTTC 
TTTTTGGATA AGATTTATTA ATCTCAGTTA 
AATTGTTTTT AATGATGGAT AAACTTGTGC 
GTTCTCCACT GGTGCTAATG TCTAATATAA 
GAGATTAATT TAGTCCATAT GAACTATTGA 
ACTCCTAAAT CAGTGTGTTT AGACTTTTCA 
TGTTTTATAA ACCTCTTGAT TTCCAGCAAC 
ACAACTTCTC AGTGTCATCC ATTGCTGTCG 
TGCAAGCTTT CAAGATAGCC TGAACTTAAA 
ATAAATTTOC CTCCTAGTTC ACTTTC3TGTC 
TTATTGGTCG GTAATAACTG AAAATAAAGA 
TAAAAACAAT CAAATAGGRT CATGTTTATT 
GCACACCCCT GTGTGTTTAT AATTTTTAAA 
AAAACCAGGC TGTTGAATCA CArtTGGGAA 
TTGGGAAAGT TGAATTCTTG TAAAATACAA 
TTGTCACTTC NTTAGGCAGA TNCACTTTTT 
CTACTTTTGG TTGACAAACG GGCCTAATAG 
CCCAAGTATG G 



TAAAAGAAAC TTTTATGTTA 
CXTTACTATTC TGACATTTTA 
TGGTGTTTTG GATCTTATGA 
TTTTATATTT ACACACATAC 
CCCATTGTTC ATTGAGACAG 
AGTATCTAAC TCATTTCCAA 
ATACTATAGA AAACACCTGC 
TGAGAGACAA CATAGCAATA 
AAGTTGGTGC ATTAGTTGTA 
AAGAGCTAAA ACTGTGAACC 
TTTATTTTCA TGCTCACTTC 
GTCATGTGTT TCCTGGKTTC 
TTGAATTTTA TATGGGGTTT 
GGGTACTTAT CTTAATGACT 
AATCCAAGGA CTTCTTQGGA 
TGGATAATGG AAAGTTAAGC 
TCCGGGGGGA AATCCCTAAC 



TAAGAGTTAC 
GGAAGGAOGT 
TGCTGAGCAT 
GTGCTACCCA 
CAACATACGC 
ACATGTACCA 
TACICAAAAC 
TCTGGTATGT 
TCTGATGGAT 
TAACTTTCTC 
TTAAAAGTCA 
TGACCTGTGT 
TTATTTGCTA 
AATGACTTAA 
TTTAATCTAA 
ATACCGAATG 
NGGTAAGCaJT 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3791 



40 (2) INFORMATION FOR SEQ ID NO: 215: 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICft^ : .SEQ ID NO: 215: 

CAGTGCTCGC TCCTGCTCGG GGCGCTGCGG CCCCGQGCGT CGCCATGACC AGTGAGCTGG 60 

ACATCTTCGT GGGGAACAGA CCCTTATCGA CGAGGACGTG TATCGCCTCT GGCTCGATGG 120 

TTACTCGGTG ACCGACGCGG TGGCCCTGCG GGTGCGCTCG GGAATCCTGG AGCAGACTGG 180 

CGCCACGGCA GCGGTGCTNC AGAGCGACAC CATQGACCAT TACCGCACCT TCCACATGCT 240 

CGAGCGGCTG CTGCATGCGC CGCCCAAGCT ACTGCACCAG YTCATCTTCC AGATTCCGCC 300 

CTCCCGGCAG GCACTACTCA TCGAGAGGTA CTATGCCTTT RATGAGGCCT TTGTTCGGGA 360 
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QGTGCTGGGC AAGAAGCTGT CTAAAGGCAC CAAGAAAGAC CTGGATGACA TCAGCACXAA 420 

AACAGGCATC ACXXTTCAAGA GCTGCCX3GAG ACAGTTTGAC AACTTTAAAC GGGTCTTCAA 480 

5 

GGTGGTAGAG GAAftTGCGGG GCTCCCTGGT GGACAATATT CAC5CAACACT TCCTCCTCTC 540 

TGACCX3GTTG GCCACSGGACT ATGCAGCCAT CGTCTTCTTT GCTAACAACC GCTTTGAGAC 60O 

10 AGGGAAGAAA AAACTGCAGT ATCTGAGCTT CGGTGACTTT GCCTTCTGCG CTOAGCTCAT 660 

GATCCAAAAC TGGACCXTTTG GAGCXX3TCGA CTCACAGATG GATGACATGG ACATGGACTT 720 

AGACAAGGAA TTTCTCCAGG ACTTGAAGGA GCTCAAGGTG CTAGTGGCTG ACAAGGACCT 780 

15 

TCTGGACCTG CACAAGAGCX: TQGTGTGCAC TGCTCTCCGG QGAAAGCTGG GCGTCTTCTC 840 

TGAGATGGAA.GCCAACTTCA AGAACCTGTC CXX3GGGGCTG GTGAACGTQG CCGCCAAGCT 900 

20 GACCCACAAT AAAGATGTCA GAGACCTGTT TGTGGACCTC GTGGAGAAGT TTGTGGAACC 960 

CTGCCGCTCC GACCACTGGC CACTCAGCGA CGTGOGGTTC TTCCTGAATC AGTATTCAGC 1020 

GTCTGTCCAC TCXTCTCGATG GCTTCCGACA CCAGGCCTCT GGGACCGCTA CATGGGCACC 1080 

25 

CTCCGCGGCT GCCTCCTGCG CCTGTATCAT GACTGAGGTG CCTCCCAACG CTCCGCCXZAC 1140 

GCTGACAATA AAGTTGCTCT GAGTTTGGAG ACTGGTCCTC GCTCCGGGGA GCAAGTGGGG 1200 

30 GGCGTGCAGA TGTGCCTGTG TCTGTCTCTG AGCACCTGGT GTCCGTGTAC AAGGATGGAT 1260 

GTGTNOK3TG GCTCCTTGGG AACTGAGACA TATCTCAGGG AATGGTGTCT GTGCTCAGCC 1320 

CATCCACCAG AAGA 1334 

35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 216: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

GTGGCGGGGA TGCTGCGAGG GGGTCTCCTG CCCCAGGCGG GCCGGCTGCC TACCCTCCAG 60 

ACTGTCCGCT ATGGCTCCAA GGCTGTTACC CGCCACOGTC GTGTGATGCA CTTTCAGCGG 120 

CAGAAGCTGA TGGCTGTGAC TGAATATATC CCCCCGAAAC CAGCCATCCA CCCATCATGC 180 

55 CTGCCATCTC CTCCCAGCCC CCCACAGGAG GAGATAGGCC TCATCAGGCT TCTCCGCCGG 240 

GAGATAGCAG CAGTTTTCCA GGACAACCGA ATGATAGCCG TCTGCCAGAA TGTGGCTCTG 300 

AGTGCAGAGG ACAAGCTTCT TATGCGACAC CAGCTGCGGA AACACAAGAT CCTGATGAAG 360 

60 
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RTcrrccccA accaggtcct GAAGCCCTTC CTGGAGGATT CCAAGTACCA AAATCTGCTG 420 

CCCCTTTTTG TGGGGCACAA CATGCTGCTG GTCAGTGAAG AGCCXJ^AGGT CAAGGAGATG 480 

GTACGGATCT TAAGGACTGT GCCATTCCTG CCGCTGCTAG GTGGCTGCAT TGATGACACC 540 

ATCCTCAGCA GGCAGGGCTT TATCAACTAC TCCAAGCTCC CXAGCCTGCC CCTGGTGCAG 600 

GQGGAGCTTG TAGGAGGCCT CACCTGCCTC ACAGCCCAGA CXXrACTCCCT GCTCCAGCAC 660 

CAGCCCCTCC AGCTGACCAC CTTGTTGGAC CAGTACATCA GAGAGCAACG CGAGAAGGAT 720 

TCTGTCATGT CGGCCAATGG GAAGCXMAT CCTGACACTG TTCCX3GACTC GTAGCCAGCC 780 

15 TGTTTAGCXA GCCCTGCGCA TAAATACACT CTGCGTTATT GGCTGTGCTC TCCTCAATGG 840 

GACATGTGGA AGAACTTGGG GTCGGGGAGT GTGTTTGTCA CTTGGTTTTC ACTAGTAATG 900 

ATATTGTCAG GTATAGGGCC ACTTGGAGAT GCAGAGGATT CCATTTCAGA TGTCAGTCAC 960 

20 

CGGCTTCGTC CTTAGTTTTC CCAACTTGGG ACGTGATAGG AGCAAAGTCT CTCCATTCTC 1020 

CAGGTCCAAG GCAGAGATCC TGAAAAGATA GGGCTATTGr CCCCTGCCTC CTTGGTCACT 1080 

25 GCCTCTTGCT GCAOSGGCTC CTGAGCCACC CCCTTGGGGC ACAACCTGCC ACTGCCACAG 1140 

TAGCTCAACC AAGCAGTTGT GCTGAGAATG GCACCTGGTG AGAGCCTGCT GTGTGCCAGG 1200 

CrTTGTGCTG AGTGCTGTAC ATGTATTAGT TCCTTTACTG CTGACCACAT TGTACCCATT 1260 

30 

TCACAGAGAA GGAGCAGAGA AATTAAGTGG CTTGCTCAAG GTCATGCAGT TAGTAAGTGG 1320 

CAGAACAGGG ACTTGAACCA AGCCCTCTGC TCTGAAGACC GCGTCCTGAA TTTCTTCACT 1380 

35 AGAGCTTCCT CATCAGGTTA CCCAGAAGTG GGTCCCATCC ACCATCCAGG TGTGCTTGGA 1440 
' ' ' TXOTAOTTCT CC^ ' " 1500" 

AAAATGAAAT A 1511 

40 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 217: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 betse pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 
50 (D) TOPOIjOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

AGGCCTTACT TTTCCTCCCA CAAAGGAGTC GCAGCCACGC TAGCTCTGAC TTGCCACTGT 60 

GACAAAGTTC ACGTAGCAGG TCTAGGCAAA GACTGGGCAA TTGAGCAGAG GAGACGGACC 120 

TGTGAGTCTG ACCRYGAGSC GGRCCCCTTC ACCTTGGCTG GGCTGGTCCT GGTCCTTAQG 180 

60 TTTTGTCAGG TIGTCCITGT TTGGATCCCT CAACTAGGTG ATAAGCACTG GAGGGGGATG 240 
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ACCCGCCTTG GACGTGTTTC TTTAACCTCA TCCATATAAT AGGGCCX3TOG GATGGTTGTA 300 

GAGGTAAAGC AGGATGATGG TGTTTTAAGA CCAGAGCTTG GGACCAGGGC TCCTACACCT 360 

5 

AATTTTCTCT CCTQGTAGCT GAACAAAGGT CTAAATTAGC ITAACAAAAG AACAGGCTGC 420 

CGTCAGCCAG AGTTCTGAAG GCCATGCTTT CAGTTTCCCT TGTTGACAAT TGCTCTCCAG 480 

10 TTCCTATGAA AGCACAGAGC CTTAGGGGGC CTGGCCACAG AACACAACCA TCTTAGGCCT 540 

GAGCTGTGAA CAGCAGGGGG TTGTCTGTCT GTTCTGTTTC TCTGCTTGCX: GAACTTTCTC 600 

AATAAACCCT ATTTCTTATT TTATATTTAC GTNQGTGCTG GG 642 

15 



20 



(2) INFORMATIC3N FOR SEQ ID NO: 218: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

GGTCCCACTG TTCCATTTTA TGCTAATAGA TTCCATTCTA GGGCCCAGCC GTCTCTTGAC 60 

TGATGGTC3TT CCCTTTAACC CTTGGCATGT ATAATAGAAT TTTGGTGAAT GAAAGAACCC 120 

AAATAGGCCA GATAGTCCCC CCAGGCCCTG ATATCCATAA AAGGCTTGGG AATGCATTAT 180 

35 GTAATTGTCC TTAGTCTTTT TGTTGTTTTA GAAAAAAAAA ACAAGATGGG CTCAGATGGA 240 

- - TGGCTAeGTA-AAAATGGTTC-CTAGGTGTGT-AeTeATAAeT-TTTeTTTGAA-T^ - 300-- 

AAAGGAAGGA GGAGGAAAGG AAATTAAATG TCCTTCTAGT ATTCTCTGGA CTCAAGTCTG 360 

40 

ACATATGRGA TAATAACCTA TATTGAAATG CCAAGAATTG TATCTGAAAC AAGRGAACAG 420 

TTTGACACAT TTATCATGCC TTCATATTAC ATATTAACTG AAACCAATTA ATAAACATAT 480 

45 GAAATATCCA TTGCACAAGG CAAAGGCACC TAAACCTTTT GTTTCTrTTT CTACATAGCA 540 

GAAATTGATT TTTTTTTTAT TTTTTTAGGG GAACCTATAT AATTATGACC CAGTGATGTC 600 

TTTTGGTGAC TTAAGCTTAT GAATTCAGGT TACAATTGAG TTGATTCTAG ATGGTTACTA 660 

50 

CCTTGAAAAG GATGTTGGTG CCTTATGTGA CACGAGCCAG AGCCTGCTGG GAATAAACAA 720 

AGCAGATTCA TGCCAACACC AACTCGTAGC TTTAGTGGCA GATGGGAGTG GTCACAGACT 780 

55 CCCAAAATGT GGGGCTTTGG ATTTCCACAC CATCCCACGT GTGTGTCATC TTCCTCTTTC 840 

ACACTCTTGA TGATAATTTG AAAATGRTGA AATCACCTCT GAATTTGCCT ATAGCATGAG 900 

CACATTCTTA TGACAACATA ACAAATAGTT CATAATGTGA ATATTAGAAA CTGTTACAGC 960 

60 
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CTGCAGTTAC CATAATTTTC CATGTTTGTG GAATTGATAT TGAAATAGCA GGGCTAAGGA 1020 

ATTACTGGCA AGTTTTAGCC TGTQGGTAAT ACXITTAGGGT TATTTAAATA TTTGrAATTT 1080 

TATTTAAATG TTCATGAATG TTTGAAAGGA ACAAAATTAT CAGGGATGGC TCTTTGCCAT 1140 

GGGTCTTATT TTCACCCTCT TTTCTGTAAG AAAAAAGAAC AATGTCTTAA TGTATTTTTA 1200 

AAC5TTTTTGG TATAGTTTCT AATTCCAATT TTAATAAAAG T 1241 



15 



20 



25 



30 



35 



40 
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(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEl^GTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 219: 

TGTTTATC5TG ACCTAAAACA TACACACATG CACACACACA TACATATCCA TTCATTCATT 60 

CATTCAAGTG GTGTTTCCAG TGTCTGTGTG TCACTGTTTA TGCAGTTTCC ATTTCCCAGT 120 

GAATTATGAG TGGAGGGCAA CTTTTCTAAC CAGATTGTCT TTTCAGAACA AAGACCKGGG 180 

RATTGAGGAA GAGTTTGGAA AGAGGGAGAG GCAAGGAAAG AGAGCTTTAA ATTGAAAGGT 240 

TAATTTCCTA AGAGGAACCT GGGCTGAATG ACTACAGTGT TATACCCTCC AATCTTTGCA 300 

GGTGGGCATG GAACACTGCT TGTATCACTC TGTGCACGGT ATAAATCCAT ATATCCACAA 360 

AAACACACAT CCATCCATCA ACATATACAT GGTTTGGGAT GAGCAGGTCA ATAGTTTTGA 420 

GAGGGAGTTT GTTCCTTTTT TTTTCTCATT ATACTCTTAA ATTGTTGTCA GTTATCAAAC 480 

AAACAAACAG AAAAATTGTT TGGGAAAAAC CTTGCATACG CCTTTTCTAT CMAGTGCTTT 540 

AAAATATAGA CTAAATACAC ACATCCTGCC AGTTTTTTCT TACAGTGACA GTATCCTTAC 600 

CTGCCATTTA ATATTAGCCT CGTATTTTTC TCACGTATAT TTACCTGTGA CTTGTATTTG 660 

TTATTTAAAC AGGAAAAAAA ACATTCAAAA AAAGAAAAAT TAACTGTAGC GCTTCATTAT 720 

ACTATTATAT TATTATTATT ATTOTGACAT TTTGGAATAC TGTGAAGTTT TATCTCTTGC 780 

ATATACTTTA TACGGAAGTA TTACGCCTTA AAAATACGAA AATAAATTTT ACAAGGTTTC 840 

TGTTrrGTGT GGAAGAGTAA TTGATGTTGC TAAGAATGAT GTTTGTTTTT TTGGGGTTTT 900 

TGTTGTTTTT TTTTTAAATG TTACCAGCAC TTTTTTTGTA AGTTTCACTT TCCGAGGTAT 960 

TGTACAAGTT CACACTGTTT GTGAAGTTTG AATATGAAGG AATAATTAAA AAAAAAAAAA 1020 

AAACCNCGGG GGGGGCCCGG TCCCATTGGN CCCAAGGGGG CGGTTACGGG GTCACGGCCG 1080 
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(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEXATTH: 1258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 
TGAATTGAGG GCTTAAAGAT AAACATATGG GRTTGGAGTT GTGTGTCCAT AGGGTTTCAC 60 
15 TC3CCTATTTG ATTTGAGTTT ATCCCTATTA ATTTTTTACA GTGAAATTTT ATTAAAGTAT " 120 
AATGTACATA TATTTTCAGT GGATTTTGCT CTGAAGGTTC TCCAGTGGTC TGACTACGAG 180 
ATAGTGCGGC TTCAGCTGTG GGATATTGCA GGGCAGGAGC GCTTCACCTC TATGACACGA 240 

20 

TTGTATTATC GGGATGCCTC TGCCTGTGTT ATTATGTTTG ACGTTACCAA TGCCACTACC 300 
TTCAGCAACA GCCAGAGGTG GAAACAGGAC CTAGACAGCA AGCTCACACT ACCCAATGGA 360 
25 GAGCCGGTGC CCTGCCTGCT CTTGGCCAAC AAGTGTGATC TGTCCCCTTG QGCAGTGAGC 420 
CGGGASCAGA TTGACCGGTT CAGTAAAGAG AACQGTTTCA CAGGTTGGAC AGAAACATCA 480 
GTCAAGGAGA ACAAAAATAT TAATGAGGCT ATGAGAGTCC TCATTGAAAA GATGATGAGA 540 

30 

AATTCCACAG AAGATATCAT GTCTTTGTCC ACCCAAGGGG ACTACATCAA TCTACAAACC 600 
AAGTCCTCCA GCTGGTCCTG CTGCTAGTAG TGTTTGGYTT ATTTTCCATC CCAGTTCTGG 660 
35 GAGGTCTTTT AAGTCTCTTC CCTrTGGTTG CCCACCTGAC MATTTTATTA AGTACATTTG 720 

. AA1TOTCTCC_TGACTAC3^ - J7.80_ 

AACCCAKGTG CATTTCTGCA TCTCCTGGAT TAGCCTTTSA CATGTTGCTG RCTCACATTA 840 

40 

GTGCCAGTTA GTGCCTTCGG TGTAAGATCT TCTCATCAGC GCTCAATTTG TGATCCGGAA 900 
TrTTGTGAGA AGGATKAGAA ATCAGCACCT GCGTTTTAGA GATCATAATT CTCACCTACT 960 

45 TCTGAGCTTA TTTTTCCATT TGATATTCAT TGATATCATG ACTTCCAATT GAGAGGAAAA 1020 

TGAGATCAAA TGTCATTTCC CAAATTTCTT GTAGGCCGTT GTTTCAGATT CTTTCTGTCT 1080 

TGGAATGTAA ACATCTGATT CTGGAATGCA GAAGGAGGGG TCTGGGCATC TGTGGATTTT 1140 

TGGCTACTAG AAGTGTCCCA GAAGTCACTG TATTTTTGAA ACTTCTAACG TCATAATTAA 1200 

GTTTCTCTTG TCTTGGGCAT CAAGANTAGT TCCAATTTTT TGGGCCGGGG CAGQGTGG 1258 



55 



<2) INFORMATION FOR SEQ ID NO: 221: 
60 (i) SEQUENCE CHARACTERISTICS: 



tSDOCID: <WO_983944aA2J_> 



wo 98/39448 



PCT/US98/04493 



448 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(A) LEI«3TH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOIjOOT: lineair 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
CACAATATAT GAAATAGTAC CCTCTAAAAA AGAGAAAAAA AAAATCAGGC QGTCAAACTT 
AGAGCAACAT TGTCTTATTA AAGCATAGTT TATTTCACT A GAAAAAATTT AATATCAAGG 
ACTATTACAT ACTTCATTAC TAGGAAGTTC TTTTTAAAAT GACACTTAAA ACAATCACTG 
AAAACTTGAT CCACATCACA CCCTGmAT TTTCCTTAAA CATCTTQGAA GCCTAAGCTT 
CTGAGAATCA TGTGGCAACT GTGATGGGCA GTAAAATACC AGAGAAGATG TTTAGTAGCA 
ATTAAAGGCT GTTTGCACCT TTAAGGACCA GCTGGGCTGT AGTGATTCCT GGGGCCAGAG 
TGGCATTATG TTTTTACAAA ATAATGACAT ATGTCACATG TTTGCATGTT TGTTTGCTTG 
TTGAATTTTT GAACAGCCAG TTGACCAATC ATAGAAASTA TTACTTTCTT TCATATGGTT 
TTTGGTTCAC TGGCTTAAGA GGTTTCTCAG AATATCTATG GCCACAGCAG CATACCAGTT 
TCCATCCTAA TAGGAATGAA ATTAATTTTG TATCTACTGA TAACAGAATC TGGGTCACAT 
GAAAAAAAAT CATTTTATCC GTCTTTTAAG TATATGTTTA AAATAATAAT TTATGTGTCT 
GCATATTGCA GAACAGCTCT GAGAGCAACA GTTTCCCATT AACTCTTTCT GACCAATAGT 
GCTGGCACCG TTGCTTCCTC TTTGGGAAGA GGAAAGGGTG TGTGAACATG GCTAACAATC 
TTCAAATACC CAAATTGTGA TAGCATAAAT AAAGTATTTA TTTTATGCCT CAGTATATTA 
TTATTTAATT TTTTAGGTAA TGCCTATCTC TTGGrTCTATT AAGGAAAGAA GCAATCAGTA 



GAGAATTCAG GATAGTTTTG TTTAAATTCT TGCAGATTAC ATGTTTTTAC AGTGGCCTGC 
TATTGAGGAA AGGTATTCTT CYATACAACT TGTTTTAACC TTTGAGAACA TTGACAGAAA 
TTATGCAATG GTTTGTTGAG ATACGGACTT GATQGTGCTG TTTAATCAGT TTGCTTCCAA 
AGTGQCCTAC TCAAGAGGCC CTAAGACTGG TAGAAATTAA AAGGATTTCA AAAACTTTCT 
ATTCCTTTCT TAAACCTACC AGCAAACTAG GATTGTGATA GCAATGAATG GTATGATGAA 
GAAAGTTTGA CCAAATTTGT TTTTTTGTTG TTGTTGTTGT TTTGAATTTG AAATCATTCT 
TATTCCCTTT AAGAATGTTT ATGTATGAGT GTGAAGATGC TAGOGAACXTT ATGCTCAGAT 
ATTCATCGTA AGTCTCCCTT CACCTGTTAC AGAGTTTCAG ATCGGTCACT GATAGTATGT 
ATTTCTTTAG TAAGAATGTG TTAAAATTAC AATGATCTTT TAAAAAGATG ATGCAGTTCT 
GTATTTATTG TGCTGTGTCT GGTCCTAAGT GGAGCCAATT AAACAAGTTT CATATGTATT 
TTTCCAGTGT TGAATCTCAC ACACTGTACT TTGAAAATTT CCTTCCATCC TGAATAACGA 
ATAGAAGAGG CCATATATAT TGCCTCCTTA TCCTTGAGAT TTCACTACCT TTATGTTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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AGTTGTGTAT AATTGTTAAA ATCTGTGAAA GAATAAAAAG TGGATTTAA?. TTA^AAAAAA 1680 
AAAAAAAAAA AAA 1693 

(2) INFORMATION FOR SEQ ID NO: 222: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 
15 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

ACGCGTGGGT CGACCCACGC GTCCGCGACN TGGCGTGGTG GGGAAQGGAG AAGGATTTGT 60 

AAACCCCGGA GCGAGGTTCT GCTTACCCGA GGCCGCTGCT GTGCGGAQAC CCCCGGGTGA 120 

AGCCACCGTC ATCATGTCTG ACCAGGAGGC AAAACCTTCA ACTGAGGACT TGGGGGATAA 180 

25 GAAGGAAGGT GAATATATTA AACTCAAAGT CATTGGACAG GATAGCAGTG AGATTCACTT 240 

CAAAGTGAAA ATGACAACAC ATCTCAAGAA ACTCAAAGAA TCATACTGTC AAAGACAGGG 300 

TGTTCCAATG AATTCACTCA GGTTTCTCTT TGAGGGTCAG AGAATTGCTG ATAATCATAC 360 

30 

TCCAAAAGAA CTGGGAATGG AGGAAGAAGA TGTGATTGAA GTTTATCAGG AACAAACGGG 420 

GGGTCATTCA ACAGTTTAGA TATTCTTTTT ATTTTTTTTC TTTTCCCTCA ATCCTTTTTT 480 

35 ATTTTTAAAA ATAGTTCTTT TGTAATGTGG TGTTCAAAAC GGAATTGAAA ACTGGCACCC 540 

CATCTCTTTG AAACATCTGG TAATTTGAAT TCTAGTGCTC ATTATTCATT ATTGTTTGTT 600 

TTCAOTGrGC TGATTTTTGG TGATCAAGCC TCAGTCCCCT TCATATTACC CTCTCCTTTT 660 

40 

TAAAAATTAC GTGTGCACAG AGAG6TCACC TTTTTCAGGA CATTGCATTT TCAGGCTTOT 720 

GGTGATAAAT AAGATCGACC AATGCAAGTG TTCATAATGA CTTTCCAATT GGCCCTGATG 780 

45 TTCTAGCATG TGATTACTTC ACTCCTGGAC TGTGACTTTC AGTGGGAGAT GGAAGTTTTT 840 

CAGAGAACTG AACTGTGGAA AAATGACCTT TCCTTAACTT GAAGCTACTT TTAAAATTTG 900 

AGGGTCTGGA CCAAAAGAAG AGGAATATCA GGTTGAAGTC AAGATGACAG ATAAGGTGAG 960 

50 

AGTAATGACT AACTCCAAAG ATGGCTTCAC TGAAGAAAAG GCATTTTAAG ATTTTITAAA 1020 

AATCTTGTCA GAAGATCCCA GAAAAGTTCT AATTTTCATT AGCAATTAAT AAAGCTATAC 1080 

55 ATGCAGAAAT GAATACAACA GAACACTGCT CTTTTTGATT TTATTTGTAC TTTTTGGCCT 1140 

GGGATATGGG TTTTAAATGG ACATTGTCTG TACCAGCTTC ATTAAAATAA ACAATA 1196 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) liQIGTH: 1791 hase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

TCAGGQAGGT GGCAGGAAAG GCTTGGAACA GCTGCCGGAG TGACGGAGCG GCGGCCCCGC 60 

CCGGTTGCGC TGGAGGTCGA AGCTTCCAGG TAGCGGCCCG CAGAGCCTGA CCCAGGCICT 120 
GGACATCCTG AGCCCAAGTC CCCCACACTC AGTGCAGTGA TC3AGTGCGGA AGTGAAGGTG - 180 

ACAGGGCAGA ACCAGGAGCA ATTTCTGCTC CTAGCCAAGT CGGCCAAGGG GGCAGCGCTG 240 

GCCACACTCA TCCATCAGGT GCTGGAGGCC CCTGGTGTCT ACGTGTTTGG AGAACTGCTG 300 

GACATGCCCA ATGTTAGAGA GCTGGCTGAG AGTGACTTTG CCTCTAOCTT CCGGCTGCTC 360 

ACAGTGTTTG CTTATGGGAC ATACGCTGAC TACTTAGCTG AAQCCCGGAA TCTTCCTCCA 420 

CTAACAGAGG CTCAGAAOAA TAAGCTTCGA CACCTCTCAG TTGTCACCCT GGCTGCTAAA 480 

GTAAAGTGTA TCCCATATGC AGTGTTGCTG GAGGTCTTGC CCTGCGTAAT GTGCGGCAGC 540 

TGGAAGACCT TGTGATTGAG GCTGTGTATG CTGACGTGCT TCGTGGCTCC CTGGACCAGC 600 

GCAACCAGCG GCTCGAGGTT GACTACAGCA TCGGGCQGGA CATCCAQCGC CAGGACCTCA 660 

GTGCCATTGC CCGAACCCTG CAGGAATGGT GTGTGGGCTG TRAGGTCGTG CTGTCAGGCA 720 

TTGAGGAGCA GGTGAGCCGT GCCAACCAAC ACAAGGAGCA GCAGCTGGGC CTGAAGCAGC 780 

AGATTGAGAG TGAGGTTGCC AACCTTAAAA AAACCATTAA AGTTACGACG GCAGCAGCAG 840 

CCGCAGCCAC ATCTCAGGAC CCTGAGCAAC ACCTGACTGA GCTGAGGGAA CCAGCTCCTG 900 

GCACCAACCA GCGCCASCCA GCAAGAAAGC CTCAAAGGGC AAGGGGCTCC GAGGGAGCGC 960 

CAAGATTTGG TCCAAGTCGA ATTGAAAGRA CTGTCGTTTC CTCCCTGGGG ATGTGGGGTC 1020 

CCAGCTGCCT GCCTGCCTCT TAGGAGTCCT CAGAGAGCCT TCTGTGCCCC TGGCCAGCTG 1080 

ATAATCCTAG GTTCATGACC CTTCACCTCC CCTAACCCCA AACATAGATC ACACCTTCTC 1140 

TAGGGAGGAG KCAAATGTAG GTCATGTTTT TGTTGGTACT TrCTGTTTTT TGTGACTTCA 1200 

TGTGTTCCAT TGCTCCCCGC TGCCATGCTC TCTCCCTTGT TTCCTTAAGA GCTCAGCATC 1260 

TGTCCCTGTT CATTACATGT CATTGAGTAG GTGGGTAGCC CTGATGQGGG TCGCTCTGTC 1320 

TGGAGCATAA CCCACAGGCG TTTTTTCTGC CACCCCATCC CTGCATQCCT GATCCCCAGT 1380 

TCCTATACCC TACCCCTGAC CTATTGAGCA GCCTCTGAAG AGCCATAGGG CCCCCACCTT 1440 

TACTCACACC CTGAGAATTC TGGGAGCCAG TCTGCCATGC CAGGAGTCAC TGGACATGTT 1500 
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CATCCTAGAA TCCTGTCACA CTACAGTCAT TTCTTTTCCT CTCTCTQGCC CTTOQGTCCT 1560 

^ GGGAATGCTG CTGCTTCAAC CCCAGAGCCT AAGAATGGCA GCCGTTTCTT AACATGTTGA 1620 

GAGATGATTC TTTCTTGGCC CTGGCCATCT CGGGAAGCTT GATGGCAATC CTGGAAGGGT 1680 

TTAATCTCCT TTTGTGAGTT TGGTGGGGAA GGGAAGGGTA TATA3ATTGT ATTAAAAAAA 1740 

10 AAAAGGTATA TATGCATATA TCTATATATA ATATGACGCA GAAATAAATC T 1791 



15 (2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2517 base pairs 

(B) TYPE: nucleic acid 
20 (C) STR7\NDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

25 ACACTAGTGG ATCCAAAGAA TTCGGCACAG CGGCACAGCA TTGTTGAGCT TTTCTCTOTC 60 

TGTGGGGCCC TCAAGCGAGC TCGACTGGTC CATCCTGGGG TAGCGASGTG GTGTTTGTGA 120 

AAAAGGACGA TGCCATCACC GCATAYAAGA AGTACAACAA CCGGTGTCTG GACGGGCAGC 180 

30 

CGATGAAGTG CAACCTTCAC ATGAATQGGA ATGTTATCAC CTCAGACCAG CCCATCCTGC 240 

TGCGGCTGAG TGACAGCCCA TCAATGAAAA AGGAGAGCGA GCTGCCTCGC AGGGTGAACT 300 

35 CTGCCTCCTC CTCCAACCCC CCTGCCGAAG TGGACCCTGA CACCATCCTG AAGGCACTCT 360 

TCAAGTGCTC AGGGGCCTCT . KTGACCACGC AGOCCACAGA VmX^^AAATC^ AAGCTTTGAG 420 

CAGGGGAGTR AGGCAGCCAG AAGTGGQGGC AGAGGAGGGT GGCTCTGTTT CCCCAAGGCA 480 

40 

AAGCTTATGA CCAATGGGCC ATCGGACTGG AGACCCCTGA TTGTGGGAM GGTTGCCAGG 540 

GATAAAGAGC TTCCTCACTG GATGGGACCC GCCTTTCTGT GTTGTGTTCT GCCCTGTCCT 600 

45 CTTCTCTCTA CGTTAACGTT TCCTGTAGTA TGTTTCTTCA TCTCATCGCC AAGGTAGGCT 660 

TGTGTTTTTM AGTGTGTGCC TCCCCGAGCC TCAGCCCCAA GCTGATTTCT TATCTGGAAA 720 

TGGTACACTG AATTCTCTGG GTGGCTTTCT TGTGGCCCCA TGGGATGCAG CGTGGGGGCT 780 

50 

GTCTGAAGGA CCCTGCTTTT TCCAGGGGCC GAGGGGCTGC CTTTCCTTTG TGTGTATTAA 840 

GCTTTTCAAA CAATGGAGGG GATGGAGAGC CCTGGTGTCC TGACGGGAGC CAGGTCGGCC 900 

55 TGAGAGCTGT GCCGCTCCTC TGTCTTGTCA GTGGAGGTGC CTGGGTGGGG AGCAGGTCTC 960 

AGGCCTCTTG TCCTCTCCCC AGTGGCTCCA GGCCTCACTA GTGGCAAGGG CAGGATGAGG 1020 

CTGCACCGCT GGGAAGAGTC TATCTAAGCT CTTGGCTTGG AGTCCCGTGT CGTCTCCRCC 1080 

60 
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CAGAGGAAGT TCTCCAGAGT TCACCTTTCC CTTTTCCTTG AGTTCTGCTG AATGCCCCAC 1140 

CCCAGCTCTC TTTCCCTTCT GGGTGTCTTT GCTGGGAGGG GGCTGTGTTG TGAGCCCTCC 1200 

CGGTTCTCAC CTCGCCTGGC ACTTAACCAC ACCCTGGTTT TGTGTAGCCG CCAGCTCTCT 1260 

TCTGGTTGGG CXMTTGAAAG GCTCAGCCTC CCATTGTGCA GTGCTTGGGT TTGGAGCTTA 1320 

TTTGAATGGA AGAGGTCAGT TTGTTCCTGG CTCTCCATTT CTTGGCCTCAG TTGTCTACAG 1380 

GACAGTGGTC AGGGATGCCT GGAC3GCATAT ATCCAGCTGC CACCAAGGGG CACrGTTTGT 1440 

TCCCACTTAT GTGAGTGACX: CCATCCATCC ATGACCAGAG GATTATTTTC CTGCCTTGQC 1500 

AGAGGAGGAG GAGTCAAGGG AGCAGGGCAG CTCTACCAGG CAAGGTGTTT CCCCAGCATA 1560 

GGCGCAGACA GTTGGGACGA AACTTCAGAG CCCAGGCAGT CCCTGAATGA CCAGGCCAGT 1620 

GTTGTCACTG AGTGGTCCCC TGCTGGTTGG GAGTQAAGAG AATCCAGGCT GGCAGAGCTG 1680 

GAGCCAGTTG GGGAGCACGG TTCTGGGAGC TCTQCAAAAT CAGTAGCAAG TGCTGGAAAA 1740 

GGCACATGCC GAAGATACTC AAGAGCTCCC AAGATTTGCT TGAGGCTAGC CCAGTGAAAA 1800 

AAACCAGAGA CTCATGTTTC CAGGGGTCAG TCTGTCAGGC AGGAAGGACC CAGGATTTGA 1860 

ACCCAGCTTC AGTGTGCAGG CTCTGAGGCT GCCCAGGACG QGAAAGTCCA AGGAAGGGGC 1920 

CTGGTGGTGC TCCACTTGCA GTTCTTTAAA GAATGCTGCT TTTTATTCTC CTAACCCTTT 1980 

CAAGTGGGTG CAGACTTCTC GTTAGCAGCT GGAAGACATT CCTCCCACAC TTTTCCCTTC 2040 

CTGGCCCAAG AGAGCATCCA GAAGGCAGTA GGACXTTOGTr TTTCAGGTAC TGGGAGCXX3G 2100 

GGGCTCACTG CTTGCACTGT GCTTAGGGTA GGGATGGTAA ATATCCTCCX: TGGATGGCIT 2160 

TATCerCCCT CTCATCCCAA AGCAGGTATC TTCTGGTTGT CAeAGAGTTT CATTGAGTCC 2220 

AGCTGCAGCC ACGTGGCCAT CTGGAGCTGG TGCTATAGGT GACCATCTGG TACATTGAGG 2280 

GGACCTCTTT GCCTCCTCCA CTCTATAAC5C AGTCATCTTG GGAGACCGGG AGGAGAAGGT 2340 

GGTGGGCTAG TCCTGTGTCC TCCTCCACTT CCCATGCCTC TATGTTACCC ATCTGTGTCT 2400 

CCTGTGCAGA AGGAGAGGAA GGGGCATTAA GAGATGAAGG GTGATTATGT ATTACTTATC 2460 

CATTTCTGAA TAAACATTTG TTATTCCTAA AAAAAAAAAA AAAAAACTCG AGGGGGG 2517 



50 



(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
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TTGTANCTAA TCGAGGATTG ATTCTAATGA CAGAGTCTTT CAACACTTTC CACATCATGT 60 

^ ATCACGAAGC TACAGCTTGC CATGTGACTG GAGATTTAGT AGAACTTCTC TCAATATTTC 120 

TTTCGGTTTT GAAGTCTACA CX3CCCTTATC TTCAGAGAAA AGATGTGAAA CAAGCATTAA 180 

TCCAGTGGCA GGAGCGAATT GAATTTGCCC ATAAACTGTT AACTCTTCTT AATTCCTATA 240 

10 GTCCTCCAGA ACTTAGAAAT GCCTGTATAG ATGTCCTCAA GGAACTTGTA CTTTTCACTC 300 

CCCATGATTT TVTTCATACT CTGGTTCCXrr TTCTACAACA CAACCATTGT ACTTACCATC 360 

ACAGTAATAT ACCAATGTCT CrTGGACCTT ATTTCCCTTTG TCRAGAAAAT ATCAAGCTAA 420 

TAGGAGGGAA AAGCAATATT CGGCXTTCCGC GCCCTGAACT CAATATGTGC CTCTTGCCCA 480 

CAATGGTCGA AACCAGTAAG GGCAAAGATG ACGITTATGA TCGTATGCTG CTAGACTACT 540 

20 TCITTTCTTA TCATCAGTTC ATCCATCTAT TATGCCGAGT TGCAATCAAC TXSTCAAAAAT 600 

TTACTGAAAC ATTAGTTAAG CTGAGTCTCC TACTTCCXTTA TGAAGGTTTG CCACTTCATC 660 

TTGCACTGTT CCCCAAACTT TGGACTGAGC TATGCCAGAC TCAGTCTGCT ATGTCAAAAA 720 

ACTGCATCAA GCTTTTGTGT GAAGATCCTG TTTTCGCAGA ATATATTAAA TGTATCCTAA 780 

TGGATGAAAG AACTTTTTTA AACAACAACA TTGTCTACAC GTTCATGACA CATTTCCTTC 840 

30 TAAAGGTTCA AAGTCAAGTG TTTTCTGAAG CAAACTGTGC CAATTTGATC AGCACTCTTA 900 

TTACAAACTT GATAAGCCAG TATCAGAACC TACAGTCTGA TTTCTCCAAC CX5AGTTGAAA 960 

TTTCCAAAGC AAGTGCTTCT TTAAATGGGG ACCTGAQGGC ACTCGCTTTG CTCCTGTCAG 1020 

35 

TACACACTCC CAAACAGTTA AACCCAGCTC TAATTCCAAC TCTGCAAGAG CTTTTAAGCA 1080 

AATGCAGGAC TTGTCTGCAA CAGAGAAACT CACTCCAAGA GCAAGAAGOC AAAGAAAGAA 1140 

40 AAACTAAAGA TGATGAAGGA GCAACTCCCA TTAAAAGGCG GCGTGTTAGC AGTGATGAGG 1200 

AGCACACTGT AGACAGCTGC ATCAGTGACA TGAAAACAGA AACCAGGGAG GTCCTGACCC 1260 

CAACGAGCAC TTCTGACAAT GAGACCAGAG ACTCCTCAAT TATTGATCCA GGAACTGAGC 1320 

45 

AAGATCTTCC TTCCCCTGAA AATAGTTCTG TTAAAGAATA CCGAATGGAA GTTCCATCTr 1380 

CGTTTTCAGA AGACATGTCA AATATCAGGT CACAGCATGC AGAAGAACAG TCCAACAATO 1440 

50 GTAGATATGA CGATTGTAAA GAATTTAAAG ACCTCCACTG TTCCAAGGAT TCTACCCTAG 1500 

CCX3AGGAAGA ATCTGAGTTC CCTTCTACTT CTATCTCTGC AGTTCTGTCT GACTTAGC1G 1560 

ACTTGAGAAG CTGTGATGGC CAAGCTTTGC CCTCCCAGGA CCCTGAGGTT GCTTTATCTC 1620 

TCAGTTGTGG CCATTCCAGA GGACTCTTTA GTCATATGCA GCAACATGAC ATTTTAGATA 1680 

CCCTGTGTAG GACCATTGAA TCTACAATCC ATGTCGTCAC AAGGGATATC TCGCAAAGGA 1740 

60 AACCAAGCTG CTTCTTGACA TTAGGTGTAG CATGTCTACT TTTAAGTCCC TCACCCCCAA 1800 
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CCCCCATGCT 
TCTCTGCTTG 
ATGAAAAAAA 
TTATGTAATT 
GGGTTGCXrTA 
GGTGTCCTCT 
CTTATTATCA 
AAGAATCCTC 
CTGTGGAGGA 
CTGATTGATT 
AAAAAAAAAA 



GTTTGTATAA 
AATGGCAAGA 
CTTGCATCTT 
AAGAGTTCAC 
GCCCTGTACA 
GTGCCAGTTT 
AAATTAAGTT 
TTTAGCTOTG 
AATAAAGCAC 
GCTATTCTGT 
AAAAAAAAAA 



GTTTTGCTTA 
TAGATTTATA 
CAGTATACTT 
TITAGTQGCT 
ATGTAATTTA 
TGTACCTTAT 
TACTTGTATA 
GCCTGGCAGT 
ACTTGATGTA 
GCACTNAATT 
AAAA 



TTTGTTTTTG 
GGCTTAATTC 
CCTAAAGGGC 
TTCATTTAAT 
AACTTACAGC 
AGAOGCAGAT 
CGGAACAACC 
ATATAAATGG 
AAAATAATTG 
AAACTGATTG 



TGCTTCAGTT 
TTGGTCAGGC 
AATCAGATAA 
ATGGCTGTCT 
ATTTTTACTG 
TGCCTCCGAT 
ACAAGAAATT 
TGCTTTATTT 
TTTTATTTTT 
TGATGACTTA 



TGTCCAGTGC 
AGAACTCCAG 
TGGATATC5TT 
GGGAAGAACA 
TGTATGATAT 
CGCTGTGGTT 
TGATTCTGTA 
AACAGAATAC 
ATTGACATGA 
AAAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO; 226: 
ATATAGGACG GATAATCTGT TTACATTCTG TTCTTCTCGA TGCACTCACA AGCGGGTAAC 
TAGGTGACAA GAAAACAAAG ATCTTATTCA AAAGAGGTCT TACAGCAACC CAACGTCTCA 
TCTTCCCATA GTAAAGATGA CGGCGCCTTG AGGTAAGCTA CAGGCAACAC CACTTCCGCG 
TTTCTCTTGC GCCCTGGTCC AAGATGGCGG ATGAAGCCAC GCGACGTGIT GTGTCTGAGA 
TCCCGGTGCT GAAGACTAAC GCCGGACCCC GAGATCGTGA GTTGTGGGTG CAGCGACTGA 
AGGAGGAATA TCAGTCCCTT ATCCGGTATG TGGAGAACAA CAAGAATGCT GACAACGATT 
GGTTCCGACT GGAGTCCAAC AAGGAAGGAA CTCGGTGGTT TGGAAAATGC TGGTATATCC 
ATGACCTCCT GAAATATGAG TTTGACATCG AGTTTGACAT TCCTATCACA TATCCTACTA 
CTGCCCCAGA AATTGCAGTr CCTGAGCTGG ATGGAAAGAC AGCAAAGATG TACAGGGGTG 
GCAAAATATG CCTGACGGAT CATTTCAAAC CTTTGTGGGC CAGGAATGTG CCCAAATTTG 
GACTAGCTCA TCTCATGGCT CTGGGGCTGG GTCCATGGCT GGCAGTGGAA ATCCCTGATC 
TGATTCAGAA GGGCGTCATC CAACACAAAG AGAAATQCAA CCAATGAAGA ATGAAGCCAC 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2424 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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TGAGGCAGGG CAGAGGGACC TTTGATAGGC TACGATACTA TTTTCCTGTG CATCACACTT 780 

AACTCATCTA ACTGCTTCCC CX5GACACCCT CCACCTCTAG TTGTTACTAA GTAGCTGCAG 840 

5 TACSGCATTGC TGGGGAAGAA ACAAACACAC ACCAAACAGT ACTGCTACTT AGTTTCTAAG 900 

GCTGCACAGG GAAGGGAAAG ACTGGGCTTT GGACAATCTA GAGGTAATTT ATATCCGCCC 960 

CCAGC3TGGAG CAACATGCGA TTCTGGAGGC ACGGGGGTAA CTGAAAC5TGA GTACATATAG 1020 

10 

TCTTTCTGGT TTCTGGAGAT AACCXZATCAA TAAAAGCTGC TTCCTCTGGG TAAAAAAAAG 1080 



15 

(2) INFORMA!TIC»J FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1336 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

25 

TTGCATTCAC AATTACTGGG AGGCAGGCAG GGGCAGTTGC ATGCTGGGGG TGGCTGCATG 60 

GSCTGCCASC TCTCCTGGGT TTGAAGGATG CGGTACASCT GCTTCAGCTG AGCAACGATG 120 

30 TTATCCTTGA TGTCTGGGGT TGAGATCTGC AGGCGGACAC TGCCACTATC AAAGGATCGT 180 

GTGAAATCAC CAGAAAACAT CTCGTAGATC ATCCGAGCCA CTACTGGAAT GACCTGAACC 240 

AAGATGAGTT TCCTTTCCAA TGGTTTCCCA TCTGGCCATT CTTCCCCAAA GCATAAGTAG 300 

35 

ATCTCAAACG GTGGCTGCTT CTCTATCTGT CCTTTCTGGT GGGCAATGAG ATCGCTAAGG 360 

AATGTTTCCA GACAAAATAG CTTGACCTTC TTTTGTCTCT CAATCAGGTT GGGAGCAACA 420 

40 AGTGATGGGG CACATGGCCC AGACCAGTAC ACCTTGCACT QGCACAGYCT GATGGCATAA 480 

ATGGCATGAC CGCTGACCTC CAGGATCAGT CCTCTGTCCA TGACGTCCAG CAGCTTGCTA 540 

GTGAACAGCT TCTGCTTCTC ATTQGTAATA TGCTCAGGAC CTGGGAATTT GACCTGCTCC 600 

45 

AGNCTGACGG GACCAAAGAG CTCCTCCTGG TCAGGCATGG GACCCAGGTC CCCATAGAAG 660 

AGTCGGCAGC CCTGAGGGTT GCTCACGGTC ATGGTCCTGC CCGTACTCCT TCCCACGGTA 720 

50 CTGAAACTTG ATGTCCAGGT CAGTCATTGG GAGAGAGCTG ATCCACAGTT CTGGAGAGCT 780 

ATAGAAGGRC TGTATAGGTG CCTGGGGWAC TTCCATCTCC AGGGGTTCAG TTTTGGGCCA 840 

CACTGCCTCC GGSCTGCAGT TGCCCACACT GCAATTGCCC ACACTGGCTG GCGCCATGGG 900 

55 

AGAACCATTG ATGTTCAGGA AGGGGAAGGT GTCCTGGATG GGAACATGGT GCTGCGACTG 960 

ATCCAGCTCA TCTTCCTCAT CTTCTTCATC CACATCATTA TCCTTCTCAT CCCAGGGAGC 1020 

60 AGACCCTGTG GATCCTGGGT TAATGATCGA SCCCTGGGGC TGAGGGATGT CACACACTTG 1080 
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ATATATCTTC ACTGGGTTCA TGGGCACCTC CCTTGGTGCX: ATCCATACAT CCAGGTTGAA 1140 

TTCTCTGCTC TTATTCAGAG CACAGCGCAG CTGGGCCTTC CATTTAGCTC GGTCAGGGTC 1200 

5 

ATCCACCCCT TCCTGGTACT TCCCTGTCTC TACAGCCCAG GCCTTAAAAA TGGTATTTTC 1260 

CTCTTCTTGT TGAGGGCTAT GCCGGC5TGGC ATGOTTCCAG GGAATCTGGA AGCGTTTAGA 1320 

10 GTCCCTGTGT AGCCAG 1336 



15 (2) INFORMATION FOR SEQ ID NO: 228: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2043 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDSNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22B: 

TCAGCTGGTC CCTTCCTPGT GTCCTGQGGG ACCTGCTGGC GGCCTCTTCC TGGGAGCCAT 60 

GACCTCAGAC CCCACCCACA CTCCAGATCG AGACCCCTGC CTCCCCCCGG CAAATGTCCT 120 

CCCGCTGCCT TGCAGCCTGC ACTTTGCACA TGCTCACCCC CAGCACAGTC CCACTGGCCC 180 

CTCAMCTCCC CTTCCCTGAG CTCCTTCCCA AGGACTCCTG GTCACTGCCT GCTGTGCAKT 240 

CAGAQGCCCA GGGTCCAGCA GCCCGGSGGG AACGGGTGCT GCCTSTTCCT CCAGTTAGCT 300 

CCAGYTCAGG TCTGAGACCC GTGYTGAGTA AAGGTCTGAG CAMCGACCGT GCCCTCTGCC 360 

CAGGGCTGGG TCCTGAGCAG CTGGTTTTCC TGCAGGAAGG TTGGAGCAAG CAAAGTCCTT 420 

CTCTGCCCTC AGGGTCAGCT GCCCAGACTG GGGCGGATGC AGAGAGGCAG GTGGGCTGTG 480 

GCTGGACTGG TCCGGAGCTG GCTTCCTTAC CAGAAAAGCC TCAGCCTTCC TCTGGAAGCA 540 

TCCCCCGTTC TGGGCAAGGG GGAAGGGCTC CTTTAAGGGG TGTGCTTTCC CAGTGGGGAG 600 

CAGTCTGGCC CTGCCCCCTA CTAAAGCCTC TGCTCTCAGC ACTTTCCCCC AAGTCCTTGT 660 

AACTTGCTTG AAGGTGGGTT CTGGCTOCCA GCCAGTCCCT GGACAAACTC TCCTGCCCCT 720 

TTTAAATTTC ACTCATTTTG TATAAACCCA GCAGGCTGGT GTTTACTTAG CCCTGTAGCT 780 

TTTTTCATTT TTTCTTTCCG TCTTTCTTCT TGAGTTCACG GTTCAATATT GCCTCCTCGC 840 

CCTGGTGAGG GGAGGTGCTG CTTTTCTGCC CCACCTGCCG GCTGGTTCCA GCAGCGCTGG 900 

NGCCCAGCTG GGGGGCCGGG ATGGGGGCTT CTCTCTCTGG GAGGGGTGCA GGTGCCCTCC 960 

CCAGGCTGGG AGGGTTCCTT CCCTAGCTCC CCATCTGCCC CCGCTGGTGA GAGTTGGGCT 1020 

TCTTGGTCTT GGAACTCCCT GGCATTGGGA ACAGAGCATT TCCAGCATTT GTTGTTGTTG 1080 
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TTTTACTCAC CTAACCCTTA GAAAATGAAT GTTAGAAGC?T GCCTGCCGAG GCGGGACAGA 1140 



GTGTTTGCTC GCGCTGGAGA AGGCTCTGCT CAGCCCTGAG ACTTCCCTTCC TCCCCCf^CCG 1200 
5 ATACTGGCAC TTTAAAAAGG AAGCTGACXX; CACAGTGTCC AGACGAATTG GCCCCCAGAA 1260 



GATGGGGAGT TCTGTCCTGC CCTTCTGTGT CTGCGTGACC TCACXCftGCX: TAGGAGGGAG 1320 



GTGCATTCAG GGTAGATTTG CCTTCTCATTC AAAGTTCTGG GGCTTTGGGY GGAAAACAGC 1380 

10 

CAGCTTTGGC GCTGTTGGGG AGACTCCTCC AGACCAGGAA (XCXAGAAGG AGACAGAGCC 1440 



TGCCACATCC TCCCACGCCA GGCCCTGGGC CAGGGTGATT GGACTGAGAA TTTGGCCACA 1500 
15 ACCAAATTGA TGCTGGCTGG AACCAGAGGC CAGAAAGCCT GGCCTTGTCC CCATGTOGGA 1560 



GCCCTGTCCT CAGCCCTCTT GTCCCCTTGA GCTCAGTGAA TTCCXACXAG GTGCXX:ACAG 1620 



CTCCTGGACT TCAAATTCTA TATATTGAGA GAGTTGGAGA GTATATCAGA GATAmTTG 1680 

20 

GAAAGGAGTT GGTCTATGCA ATGTCAGTTT GGAATCTTCT TGAAAGTTTA ATGITTTTAT 1740 



TAGGAGATTT AAAGT^AAATA AAGGTCTACA ATATCTTTAG G * rri ' rri ' I " XT TTTCCTGTTT 1800 
25 ACCGCACAAA CTGACCACAT GGCATGTCTA TCAGGATGGA GGGTGTCCAT GTTCTCCTCT 1860 



GTCTTTAGGG AGGTGATAAG GAGATGGSCG RAGGGGTGTT TTTTTCTTTG ACTCCCCTCC . 1920 



TTTCTAACAG AATGTTGCCA CCACTGCTTG AGTGGGCTGT GTTTGTTCCT CTGTCCCAGC 1980 

30 

TTCTGTTGTA GAAAATAACA TTGTTAGGGG AACTCAGGCT AGTGTCAGCG TCTTGGTTTG 2040 
GGG 2043 

35 

(2) INFORMATION FOR SEQ ID NO: 229: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 229: 
TAAAAAGAAG CGGGAGAATC TGGGCGTCGC TCTAGAGATC GATGGGCTAG AGGAGAAGCT 60 
50 GTCCCAGTGT CGGAGAGACC TGGAGGCCGT GAACTCCAGA CTCCACAGCC GGGAGCTGAG 120 



CCCAGAGGCC AGGAGGTCCC TGGAGAAGGA GAAAAACAGC CTAATGAACA AAGCCTCCAA 180 



CTACGAGAAG GAACTGAAGT TTCTTCGGCA AGAGAACCGG AAGAACATGC TGCTCTCTCT 240 

55 

GGCCATCTTT ATCCTCCTGA CGCTCGTCTA TGCCTACTGG ACCATGTGAG CCTGGCACTT 300 



CCCCACAACC AGCACAGGCT TCCACTTGGC CCCTTGGTCA GGATCAAGCA GGCACTTCAA 360 
60 GCCTCAATAG GACCAAGGTG CTGGGGTGTT CCCCTCCCAA CCTAGTGTTC AAGCATGGCT 420 
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TCCTGGCX3GC CCAGGCCTTG CCTCCCTGGC CTGCTGGGGG GTTCCGGGTC TCCAGAAGGA 480 
CATGGTGCTG GTCCCTCCCT TAGCXTCAAGG GAGAGGCAWT AAAGACACAA AGCTGGAAAT 540 



(2) INFORMATION FOR SEQ ID NO: 230: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

AATTGTGAAA TATTAGAATA TTGTTACTAT TTGACCCAAC TCAAAATCTC CATGGGAAAA 60 

20 

TACCTGTCGA TACCCACAGT ATTGTTGAAA ATAATCAGAT GCAGTATCAC AGCTGTGTCA 120 
GACTCTAGTA CCAGTTGGGC AATCAAGGCA CAGCTAAAAA TTGAAAACAA AGATCTGGAC 180 
25 AACAAAACAG CCAAAGGTGG GGGTCAAGAA GCTCTGACGT GTACCTAGCT GTAGAATGCT 240 
ATGCACACGT GCCAGGTGTA GTGTGCATAT CCAGGAAAAA CTGCAGAGAG CCCCAGTCTT 300 
CAMCTCTGGT TGACCATGAG CTCTGTGTAA GCAGGAAGTG AAGGCTAAGG CAGATTTAAG 360 

30 

CTCTGAAAGC ATTCCACAAC ATACACACAA ATCGTGCAAA GCATTAAGGA AATCTTGTTA 420 
CTGCTAAGTG TTGCTGACCC AGGAACAA 448 

35 

(2) INFORMATION FOR SEQ ID NO: 231: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 



GTATGCTGCC CCAAACCAAT ATGTGTGGCT GCCTTTWACC TGACTTCTCC AACATGTAGC 60 

50 CCCAAGAGGA GGCCTCTAGA CTRAGGGAGG GGCTGGTGAC CCAGGTGTGG TGGQGCTGCA 120 

TGARACTACC AGAGAGACAG ACATTCTGGA ACTCACCCTG GGGGATCCAG TGGATCTGCC 180 

TATGGTCTGG TCCACCCCAG ACCTGTGAGA TGTTCCTCAT GAGGATGCAC TTGTGCTTCT 240 

55 

GCAAGTATTG CTGCAGCTTC ATAGTGACTC CCACCAGCAC CAGCAATACA GYTAGCTACC 300 

TGTGGCCTTG GATCTCAGCC AGCATGGCTG GGAGAGGGAG CARCTGGGCA TGTACCCTAA 360 

60 ATGCTGTTAC CAGGGAAGGA CTCCCAGAGT GAAGACAAGT AGGGACT 407 



wo 98/39448 PCT/US98/04493 
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5 (2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 
10 (c) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 232: 

15 GTAtTTGATT TCAGGCTGCT AAATGGGCTC ATTTAGCATT CATTCCTTGA TCTAGACATT 60 

AAAAAAAAAA CTGAATAGCA TTCTTTCCAG GNTAACTAAT AAAGCAGACA TC3CTAAGCCT 120 

ATAAATACAT CAGCACTGCA GCACACGTTT AAGGTTGCCA CGGACAAGGA TCACACAATA 180 

20 

GAGAACACTG TAGTTCGGTC TGCTCACAAG ACCCAGAACA TTGATCAGTr TTTGITCTTC 240 

GTTTATTATT TITCTGTrAA AAAATTGTGA AAAGnTCTT TTAGCTAGAT GATATTTTAA 300 

25 TAGCTGCGAG TGCTTTGGAA CTATAAAGAT GTCACTACTT AACACACATA CCTTATCnTT 360 

TGTTTTGTTT TGTTTTACAC TCAGTATAAA TCAGGAGAAG TTAGCCAACC ATCTAGCATT 420 

TAGAATCCTC TTTTTTATTG TCTTCTAAGG ATATGGATGT TCCCATAACA GCAACAAAAC 480 

30 

AGCAACAAAA ACATTTCATA AATATCACTT GATAGACTGT AAGCACCTGC TTAACTTTGT 540 

GTNCCAAATA TTTAGTGTGT ATATATATAT ATATATACAC ACACACACAC ATATATATTC 600 

35 AACAAATAAA GCAAAATATA ACATGCATTT CACATTITGT CTTTCCCTGT TACGATTTTA 660 

ATAGCAGAAC TGTATGACAA GTTTAGGTGA TCCTAGCATA TGTTAAATTC AAATTAATGT 720 

AAAACAGATT AACAACAACA AAGAAACTGT CTATTTGAGT GAAGTCATGC TTTCTATTAT 780 

AATAACTTGG CTrCGGTTAT CCATCAAATG CACACTTATA CTGTTATCTG 830 



40 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 233: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 
50 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

CCAGAAGAAA GACCAATCTA GAATATGGAA CTCTAATCAC TTCTAGTATT TCAACTTCCT 60 

AGCAGAAATG AACTTGGCCC TAGACCTAGG GGATAAGCAA TGTTCTTTAT GTAGCCAATG 120 

60 CTACGGAAAC AAAAGAGGTG AAAGAGACCC TTTTTTTATA CTTAATGTAC ATATATTCAC 180 
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TTTTTGAGCA AGAATGCCAG AAATAGCCTT CATTTCTACC CTGCAAAATA ATCXAGATCT 240 

GCTTTCTAAA ATXSRANTCAG TTTCTAAAGT GAAACATGCA ATATTTATGC TCTGACTGAC 300 

5 

TCCTGAATTG GARGAGGAAG IWrrTCTGTT TACAGAAAAC YGTATTGTTA TATATGTCAG 360 

GCTGTGTATT GTGACTATCA GCATTCTGGT GCAAATGAAC TTTTCTCCAT CATCGACTGT 420 

10 GGAAAATTGA TACTTTTAAA GCATATTCTT CTATGAGCAC AGGTCCTCCT AGTGAAACTT 480 

AATTTGACAA AGGGTCTCAT ATGCTTTCCT AACCTGAWTT GTATTAACAT TCACAGAGCC 540 

TACATTTTCT CATTAGGGTT RTCATGCTCA GTATCTTTCC AAGTGCCAGG CAGRGCTTNC 600 

15 

CTTTTCTGAT CAAACATACC ATTTTTTGTA TTTCACAACT ATAGACAGTC ACTTCTGCAG 660 

TCCCAATTTA AAAATCCAGA ACTGCTTTAT CXMGAATCC TGAAAAATAC TGTTCTATCC 720 

20 AGGTTTCCTA AACTATAAAA GCAGATTTTG CTTTTGTTTG TTAATCATAG GCATGGCCGA 780 

GCATTGTGGA TTAGCXTTGAG GCTTAAAATC AGATGCATGT CTGGTAAGAT GACCACTGTC 840 

TCACTATCAA GAGCTTGCAG AGCCATTTTC CAGACCTGTG ATTGCXXrAGA ACACATAGTC 900 

CCCACGTTTC TAATTTGGAG CAAATCTAAA AG 932 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 234: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2786 base pairs 
35 (B) TYPE: nucleic acid 

(C) STOANDiEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

TTAGCAGGGT GAGCTGTTAA AACAGCACAC ATCTCTCATC CCCTCTTCCT TTATTCCCCC 60 

CTGGGTTTCA GAAAGGAAGG ATATATGGGG ACCACCTCCC CCTTCTTTGA TCCCAGCATC 120 

45 TCAGTCCCCC TCCCAACCCT CCATATGGCT CTCAATGGTG CTCACTTGCT TGGAAGCAGG 180 

CTCCCAATAG GGAGGGGSCT GCCCTCTACA GTCTCTTTGA CTGTAAGACA GGGCTCTGTA 240 

TCAGTGAGAC GATGAGAAAA GTCCCAGGCT AATQGCAGAA ATTTGCACTT TGAACATGTG 300 

50 

TGTTTTTGTG TTGTGGAACC TGAGATTCCT TATTTATTAA CAGGAAGTCT GATTTTTTTT 360 

TTITGGAGTC TTTGTTGCTA TATTTTGTGG GGCTGGGAGA GAGAGATTAG ATTATTTTGA 420 

55 CATGGGATCC CTTCCATAAC AGGTACTTTG AAGGCAAGAC ATAGGGTTGA AGAAGCACAA 480 

CCAGCCTCTG AAATCATAGC TCTCCAGTGG CTTTTAAAGA AAGCTGGTCC TCAGCACTAA 540 

CAAAATCACT ACAATAGCCT AGTGCTTTTT TGGAAGCCTT TTTAGGGAAG AATGTTAGGT 600 

60 



wo 98/39448 



461 



PCT/US98/04493 



TCATGGTAAC TAGTATGCTC TTTGAGATTT 
GGGTGAOGAG GGTTGTTCAG AATCTAAATT 
5 TTTCTTTTCC TTTTrTTTTG TCCCTACCAT 
GGCTCCTTGC TTTrTGTTTC TTQCTTTGCT 
AAGGACACTG CTGTTAGTGA AQGAACAAAG 

10 

AAACTGCTCT GTTTCCCCTT TAGTAACACT 
TTAATAATCC TATATAATAA TTGCTTTGGC 
15 CCTTGGGATA GAGGTTGTGT TGGGC5AATAG 
GGTAGTGTTT TTGTACAAGG TCATACCGCC 
TCAGGTAAAG AGTAATTCCT ATCCTGTGTG 

20 

AOTCATTGTT TATTGTCAGA AATGCATGAT 
AAAAAAAAAA AGAACAGTTT GTGrTTCACA 
25 CTTTTTTCCC AAAAAGAGGA GTAACAAAAT 
AACTAGTGTC AGCATTTGGG ATGCCAGGGA 
CTCAAATGCG CTATTGTTTC TTTTCAGAGT 

30 

GGGATAGAAA ATGGAATAAA GATAGAAGGG 
TTGGAGTCGA CTTTGGTATA TTGACTAGAT 
35 ACAAAAAAGT TGTCCTCCTC TCAGGTCCCT 
TTCCACACTT AGCTTTTTTG TCACACTTAT 
GTTAGTCATC AGATATTTTA GCCACCTACA 

40 

CTGAGATGGG AGAAAATGTA TTCTCCTTTC 
AGTTAGTTCT ACTAATTAGA AACTTGCTGT 
45 CCTCTTTATA GCTACCATTT GCXTTACAATA 
ATTTTTTATA GACTTTATAT TTTTCCTTTT 
GTAATTAAAC TATCTCAGCC GTTTCCCTGC 

50 

CTTCCAGGGA GCTCTTTTAA TCTTAAAGTT 
TACCTTTTTA ATAACTCTTC CCACTGCATA 
55 TGAAACTGTA GTTGAGATAC AGCTATTTAA 
KGTGGTTGCC CAAGGTTGTT TTGCGTAACT 
GGCAAACACT GGCCATGGCC GTGGGAGTAC 

60 



TTACAGTGTT GAAACTTAAG AATTTTGAGA 660 

ACAGATAGAT GATTGTOTCT TGTGAATTTG 720 

TTCCTTACAT TTCCCTTGGG GCCCATCTCT 780 

TTATCAGTTC ATTCCAGCTC CCTGTTAGTG 840 

TCTATGAGTC CTAAAATTTT AAGTCAAAGA 900 

TCTGAAGAGG AAAAACTTCA ATAGCCAAAG 960 

TTTCACCTAA AATTCTGGGC ATCACAATTT 1020 

ATTGCTTATT GCTGTTCACT GGAGAGAAAA 1080 

AGAAGCCCCA AATCCTATTT TGGCTCATCT 1140 

CCTCAGAAGC TAGAATCGAA GGCTTACCCT 1200 

GGCTCTTGGA AAGAATGACG TTTTGCTGGA 1260 

AACATGGCTT ATCAATTTTT TCAAAGAAIT 1320 

GTCATTTCTG AAAGAGGCTT ACTTTATACC 1380 

ACAGAGAGTG AGACACCTAC AATCACCAGT 1440 

GTTGCAGATT TGCCATTTCT CCATAATATG 1500 

ATGTAGAATA TGCTTTCCTG CCAACATGGT 1560 

TTGAAAATAC AAGATTGATT AGATGAATCT 1620 

TTTACACTTT 1TGACTAACT AGCATCTATA 1680 

CCTTTGTCTC CGTAAATTTC ATTTGCAGTG 1740 

CAAAAGCAAA CTGCATTTTT AAAAATCTTT 1800 

CTATACCX5CT CTCCCAACAA AAAAACAACT 1860 

ACTTTTTCTT TTCTTTTAGG GGTCAAGGAC 1920 

AATTATTGCA GCAGTTTGCA ATACTAAAAT 1980 

GATAAAGGGA TGCTGCATAG TAGAGTTGGT 2040 
TTTCCCTTCT GCTCCATATG CCTCATTGTC . 2100 

CTACATTTCA TGCTCTTAGT CAAATTCTGT 2160 

TTTCCATCTT GAArTGGTGG TTCTAAATTC 2220 

TATTTCTGGG AGATGTGCAT CCCTCTTCTT 2280 

GAGACTCCTT GATATGCTTC AGAGAATTTA 2340 

TGGGAGTAAA ATAAAAATAT CGAGGTATAG 2400 
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ACTAGCATCC ACATAGAGCA CTTGAACCTC CTTTGTACCT GTTTGGGGAA AAAGTATAAT 2460 

GAGTGTACTA CCAATCTAAC TAAGATTATT ATAGTCTGGT TCTTTGAAAT ACCATTTTTT 2520 

TCTCCTTTTG TGTTTTTCCC ACTTTCCAAT GTACTCAAGA AAATTGAACA AATGTAATGG 2580 

ATCAATTTAA AATATTTTAT TTCTTAAAAG CXTTTTTTTGC CTOTTGTAAT GTGCAGGACC 2640 

CTTCTCCTTT CATGGGAGAG ACAGGTAGTT ACCTGAATAT AGGOTGAAAA GGTTATGTAA 2700 

AAAGAAATTA TAATAAAAGG GATACTTTGC TTTTCAAATC TITOrTTTCT CTTATTCTAG 2760 

GTAAGGCATA TTAAAAATAA ATATGT 2786 



(2) INFORMATION FOR SEQ ID NO: 235: 

20 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
GGGTGCAGGA ATTCGGCACG AGAQAATGTT TGATTTTCTT TCCTATTTTA AGGATCTTCT 60 
30 CTCTTGTTGA TGTTGAAAAC TTACCTTAGT GAAGATGTGT TTCAACATGC TGTTGTCCTT 120 ' 

TACCTGCATA ATCACAGCTA TGCATCTATT CAAAGTGATG ATCTGTGGGA TAGTTTTAAT 180 
GAGGTCACAA ACCAAACACT AGATGTAAAG AGAATGATGA AAACCTQGAC CCTGCAGAAA 240 

35 

GGATTTCCTT TAGTGACTGT TCAAAAGAAA GGAAAGGAAC TTTTTATACA ACAAGAGAGA 300 
TTCTTTTTAA ATATGAAGCC TGAAATTCAG CCTTCAGATA CAAGGTACAT GCCCTCTTTC 360 
40 TTTTCATGCC ATCTCTTTTG CACTCTCAGG TGGAAATATT aTTAAGTGTT TTATAATCAT 420 
AAGTTCTTGT GAAACCTAAC AAGATTATCC CTTCCTAA 458 



(2) INFORMATION FOR SEQ ID NO: 236: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEMOESS : double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

AGGATGAAGA GGAAATTATC TCTTGGATTG CTCTCCAGGA AATCCTTCTC TATACTTTAA 60 
AAGCTCTTGT TCTTTTCTAG GARTCCAATG TGCTGATTGC TGCTAACAGT CAGGGTACAA 120 

60 



500CID: <WO_9839448A2_r.> 



wo 98/39448 



463 



PCT/US98/04493 



TTAAGGTGCT AGAATTGGTA TGAAGGGTTA ACTCAAGTCA AATTGTACTT GATCCTGCTG 180 

AAATACATCT GCAGCTGACA ATGAGAGARG AAACAGAAAA TGTCATGTCA TGTCTCTCCC 240 

5 CAAAGTCATC ATGGGTTTTG GATTTGTTTT GAATATTTTT TCTTTTTTTC TTKTCCCTCC 300 

TTTATGAGCC TTTGGGACAT TGGGAATACC CAGCCAACTC TCCACCATCA ATGTAACTCC 360 

ATGGACATTG CTCCTCTTGG TGGTGTTATC TAATTTTTGT GATAGGGAAA CAAATTCTTT 420 

10 

TGAATAAAAA TAAATAACWA AACAATAAAA GTTTATTGAG CCACAGTTGA GCTTGGAAAG 480 

TTTTTGTCAA ATCCNGCAAG AGATAACTCT TTTTANGAAG TAGCATATGT GAACTATAAT 540 

15 GTAACAGTGA ATAATTTGTA AAGTTCGTAT TTCCX:AACCT CTTTGGGAAT T 591 

20 (2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1286 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

30 TCTTTTTAAG GTACAGCAGG GAAGAACTGG AAACTCAGAG AAAGAAACTG CCCTTCCATC 60 

TACAAAAGCT GAGTTTACTT CTCCTCCTTC TTTGTTCAAG ACTGGGCTTC CACCGAGCAG 120 

GAGATTACCT GGGGCAATTG ATGTTATCGG TCAGACTATA ACTATCAGCC GAGTAGAAGG 180 

35 

CAGGCGACGG GCAAATGAGA ACAGCAACAT ACAOGTCCTT TCTGAAAGAT CTGCTACTGA 240 

AGTAGACAAC AATTTTAGCA AACCACCTCC GTTTTTCCCT CCAGGAGCTC CTCCCACTCA 300 

40 CCTTCCACCT CCTCCATTTC TTCCACCTCC TCCGACTGTC AGCACTGCTC CACCTCTGAT 360 

TCCACCACCG GGTTTTCCTC CTCCACCAGG CGCTCCACCT CCATCTCTTA TACCAACAAT 420 

AGAAAGTGGA CATTCCTCTG GTTATGATAG TSGTTCTGCA CGTGCATTTC CATATGGCAA 480 

45 

TGCGATGAAG AACGATACAG ATACAGGGAA TATGCAGAAA GAGGTTATGA GCGTCACAGA 540 

GCAAGTCGAG AAAANGAAGA ACGACATAGA GAAAGACGAC ACAGGGAGAA AGAGGAAACC 600 

50 AGACATAAGT CTTCTCGAAG TAATAGTAGA CGTCGCCATG AAAGTGAAGA AGGAGATAGT 660 

CACAGGAGAC ACAAACACAA AAAATCTAAA AGAAGCAAAG AAGGAAAAGA AGCGGGCAGT 720 

GAGCCTGCCC CTGAACAGGA GAGCACCGAA GCTACACCTG CAGAATAGGC ATGGTTTTGG 780 

55 

CCTTTTGTGT ATATTAGTAC CAGAAGTAGA TACTATAAAT CTTGTTATTT TTCTGGATAA 840 

TGTTTAAGAA ATTTACCTTA AATCTTGTTC TGTTTGTTAG TATGAAAAGT TAACTTTTTT 900 

60 TCCAAAATAA AAGAGTGAAT TTTTCATGTT AAGTTAAAAA TCrTTGTCTT GTACTATTTC 960 
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AAAAATAAAA AGACAGCAAT GACTTTATAT CCAAGAAAGG AATGTGAATG AGTCACTTAA 1020 

^ CAGGGAATCT AAAGAGCTGT GTTAGCTC3TG TACATACACA GATTATCTGA GAAAAGGTCA 1080 

AGGGTTCCAC TTGC3GCCACA GTTTTTTTGT TAATCAAACA CCACTCTCTT AAGRGGCTCC 1140 

ATCACAAARG GCAACCAARG GGCCCCTCTT ARGGCTTTGA GGATTAAAAC TAGTCTTTAT 1200 

10 CTATTACTGC TGTGGACACT CTTGGCTTRG TATWTTTAGG GGGCSOTCCTT ACCTTTTTTT 1260 

GGTTTTCCNC ACCTTTTTGG TTGGGC 1286 

15 

(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 734 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8: 

ATGGCAGCGC AGAAGGACCA GCAGAAAGAT GCCGAGGCGG AAGOGCTGAG CGGCACGACC 60 

CTGCTGCCGA AGCTGATTCC CTCCGGTGCA GGCCGGGAGT GGCTGGAGCG GCGCCGCGCG 120 

30 

ACCATCCGGC CCTGGAGCAC CTTCGTGGAC CAGCAGCGCT TCTCACGGCC CCGCAACCTC 180 

GGAGAGCTGT GCCAGCGCCT CGTACGCAAC GTGGAGTACT ACCAGAGCAA CTATGTGTTC 240 

35 GTGTTCCTGG GCCTCATCCT GTACTGTGTG GTGACGTCCC CTATGTTGCT GGTGGCTCTG 300 

GCTGTCTTTT TCGGGGCCTG TTAACATTCT CTATCTGCGC ACCTTGGAGT CCAAGCTTGT 360 

CCTCTTTGGC CGAAAGGTGA GCCCAGCGCA TCATATGCTC TGGCTGGAGG CATCTCCTTC 420 

40 

CCCTTCTTCT GGCTGGCTGG TGCQGGCTCG GCCGTCTTCT GGGTGCTGGG AGCCACCCTG 480 

GTGGTCATCG GCTCCCACGC TGCCTTCCAC CAGATTGAGG CTGTGGACGG GGAGGAGCTG 540 

45 CAGATGGAAC CCGTGTGAGG TGTCTTCTGG GACCTGCCGG CCTCCCGGGC CAGCTGCCCC 600 

ACCCCTGCCC ATGCCTGTCC TGCACGGTCT GCTGCTCGGG CCCACAGCGC CGTCCCATCA 660 

CAAGCCCGGG GAGGGATCCC GCCTTTGAAA ATAAAGCTGT TATGGGTGTC ATTCAAAAAA 720 

50 

AAAAAAAAAA AAAA 734 



55 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 base pairs 
60 (B) TYPE: nucleic acid 



35 



(2) INTORMATION FOR SEQ ID NO: 240: 
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(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

^ <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

CGGGGTCTTC AC3GGTACCGG GCTGCSTTACA GCAGCTCTAC CCCTCACGAC GCARACATGG 60 

CAGCGCAGAA GGACCAGCAG AAAGATGCCG AGGCGGAAGG GCTGAGCGGC ACGACCCTCC 120 

10 TGCCGAAGCT GATTCCCTCC GGTGCAGGCC GGGAGTGGCT GGAGCGGCGC CGCGCGACCA 180 

TCCGGCCCTG GAGCACCTTC GTGGACCAGC AGCGCTTCTC ACGGCCCCGC AACCTGGGAG 240 

AGCTGTGCCA GCGCCTCGTA CGCAACC5TGG AGTACTACCA GAGCAACTAT GTGTTCGTCT 300 

TCCTGGGCCT CATCCTGTAC TGTGTGGTGA CGTCCCCTAT GTTGCTGGTG GCTCTGGCTG 360 

TCTTTTTCGG CGCCTGTTAC ATTCTCTATC TGCGCACCTT GGAGTCCAAG CTTGTGCTCT 420 

20 TTGGCCGAGA GGTGAGCCCA GCGCATCAGT ATGCTCTGGC TGGAGGCATC TCCTTCCCCT 480 

TCTTCTGGCT GGCTGGTGCG GGCTCGGCCG TCTTCTGGGT GCTGGGAGCC ACCCTGGTGG 540 

TCATCGGCTC CCACGCTGCC TTCCACCAGA TTGAGGCTGT GGACGGGGAG GAGCTGCAGA 600 

25 

TGGAACCCGT GTGAGGTGTC TTCTGGGACC TGCCGGCCTC CCGGGCCAGC TGCCCCACCC 660 

CTGCCCATGC CTGTCCTGCA CGGCTCTGCT GCTCGGGCCC ACAGCGCCGT CCCATCACAA 720 

30 GCCCGGGGAG GGATCCCGCC TTTGAAAATA AAGCTGTTAT GGGTGTCATT CAGGAAAAAA 780 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 809 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

TCGACCCACG CGTCCGGCAA CATGGCGGCT GCCGTGGTGC AGCGCCCGGG CTGAGCGACA 60 

GCAAGTGCAG CGGGCTCCTA CCCCGGGTGA GGGGTGGCCT CCGCGTGGGA TCGTGCCCTC 120 

50 

TTCAGCCCGC TCCTGTCCCC GACATCACGT GTATTCCGCA CGTCCCCTCC GCGCTGTGTG 180 

TCTACTGAGA CGGGGAGGCG TGACAGGGCC CGGGTCCCTT CTCAGTGGTG CTCTGTGCTT 240 

55 CAGGGCAAGC TCCCCGTCTC CGGGCGCACT TCCCTCGCCT GTGTTCGGTC CATCCTCCTT 300 

TCTCCAGCCT CCTCCCCTCG CAGGCGGATG AMCCGGACGA CGGGCCAGTG CCTGGCACCC 360 

CGGGGTTGCC ARGGTCCAMG GGGAACCCGA AGTCCGAGGA GCCCGARGTC CCGAACCAGG 420 

60 
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ARGGGCTGCA GCGCATCAMC GGCCTGTCTC 
TGTGCTACAT CAATCTCCTG AACTACATGG 
5 ACATCGAGCA GTTCTTCAAC ATCGGGGACA 
TCTCCAGTTA CATGGTGTTG GCACCTGTGT 
AGTATCTCAT GTGCXX3GGGC ATTGCCTTCT 

10 

TCCCCGGAGA GCATTTCTGG CTGCTCCTCC 
CCAGTTATTC CACCATCGCG CCCACTCTCA 
15 CCX3GATGCTC AGCATCTTCT ACTTTGCCAT 
AGGCTCCAAA GTGAAGGATA TQGCTGGAGA 
TCTAGGAGTG GTGGCCGTTC TGCTGCTGTT 

20 

CGTGGAGCGC CACTCAGATT TGCCACCXTCT 
GGCTCTGGCA AGAAATCCTA GTTTCX?rCCT 
25 TGTCACGGGC TCCCTGGCTC TGTGGGCTCC 
TGGGGAGACC CCACCCTGCC TTCCCGGAGA 
TGGACTCATC ACCTGCCTGA CCX3GAGTCCT 

30 

CCX3GCTCCGC CACTCCAACC CCCGGGCTGA 
CTCTGCACCC TTCCTCTTCX: TGTCCCTTGC 
35 TATTTTMATC TTCATTGGAG AGACCCTCCT 
TCTGCTGTAC GTGGTGATCC CTACCCGACG 
GTCCCACXTTG CTGGGTGATG CTGGGAGCCC 

40 

GCGCCX3GAAC TGGCCCCCCT CCTTCTTGTC 
GCTCTGCGCG TTTGTTGGGG CACTGGGCX3G 
45 GAGGCCGACC GCXTGCCGGGC ACAGCTGCAC 
ACAGACGACC GGATTGTGGT GCCCCAGCGG 
GTGCTCATCT GAGARGCTGC CGCTCACCTA 

50 

cx:accccacg aagggcxtog gcctaacccc 
gggccgtgtg ccagctccca gacactacot 
55 ggagggggat ccctctccac aggggcagcc 
aaaantcta gccagacxxtc agctgcctgc 

TGCACCCXTGT CTTCACCCCA GGGCTCCTGA 

60 



CCX5GCCGTTC GGCTCTCATA GTGGCGGTX3C 480 

ACCGCTTCAC CX3TGGCTGGC GTCCTTCCCX5 540 

GTAGCTCTGG GCTCATCCAG ACCGTGTTCA 600 

TTGGCTACCT GGGTGACAGG TACAATCGGA 660 

GGTCCCTGGT GACACTGGGG TCATCCTTCA 720 

TGACCXX3GGG CCTGGTGGGG GTCGGGGAGG 780 

TTGCCGACCT CTTTGTGGCC GACCAGCXX3A 840 

TCCGGTGGGC AGTGGTCTGG GCTACATTGC 900 

CTGGCACTGG GCTCTGAGGG TGACACCGGG 960 

CCTQGTAGTG CGGGAGCCGC CAAGGGGAGC 1020 

GAACX:CCACC TCGTGGTGGG CAGATCTGAG 1080 

GTCTTCCXTTG GGCTTCACTG CTGTGGCCTT 1140 

GGCATTCCTG CTGCGTTCCC GCGTGGTCCT 1200 

CTCCTGCTCT TCCTCTGACA GTCTCATCTT 1260 

GGGTGTGGGC CTGGGTGTGG AGATCAGCCG 1320 

TCCCCTGGTC TGTGCCACTG GCCTCCTGOG 1380 

CTGCX3CCCGT GGTAGCATCG TGGCCACTTA 1440 

GTCCATGAAC TGGGCCATCG TGGCCGACAT 1500 

CTCCACCGCC GAGGCCTTCC AGATCGTGCT 1560 

CTACCTCATT GGCCTGATCT CTGACCGCCT 1620 

CX3AGTTCCGG GCTCTGCAGT TCTCGCTCAT 1680 

CXXy^CTTTCC TGGGCACCGC CATCTTCATT 1740 

GTGCAGGGCC TGCTGCACGA AGCAGGGTCC 1800 

GGCCX3CTCCA CCCGCGTGCC CGTGGCCAGT 1860 

CCTGCACATC TGCCACAGCT GGCCXTTGOGC 1920 

TTGGCCTGGC CXy^XTTTCCA GAGGGACXXT 1980 

GGGTAGCTCA GGGGAGGAGG TGGGGOTCCA 2040 

CCAAGGGCTC GGTGCTATTT GTAACGGAAT 2100 

TCTCGTCTTT CTCTGGGTGG CCTCTGATCT 2160 

AGACTGTGGG T 2201 
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(2) INFXDRMATION FOR SEQ ID NO: 241: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1661 bcise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
10 (D) ^TOPObOCT: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

GTCCTTCCCG ACATCGAGCA GTTCITCAAC ATCGGGGACA GTAGCTCTGG GCTCATCCAG 60 

ACCGTGTTCA TCTCCAGTTA CATGGTGTTG GCACCTGTGT TTGGCTACCT GGGTGACAGG 120 

TACAATCGGA AGTATCTCAT GTGCGGGGGC ATTGCCITCT GGTCCCTGGT GACACTGGGG 180 

20 TCATSCrrCA TCCCCGGAGA GCATTTCTGG CTGCTCCrCC TGACCCGGGG CCTCGTC3GGG 240 

GTCGGGGAGG CCAGTTATTC CACCATCGCG CCCACTCTCA TTGCCGACCT CTTTGTGGCC 300 

GACCAGCGGA SCGGATGCTC AGCATCTTCT ACTTTGCCAT TCCGGTCGGO-'AGTGGTCTGG 360 

GCTACATTGC AGGCTCCAAA GTGAAGGATA TGGCTGGAGA CTGGCACTGG GCTCTCAGGG 420 

TGACACCGGG TCTAGGAGTG GTGGCCGTTC TGCTGCTGTT CCTGGTAGTC CGGGAGCCGC 480 

30 CAAGGGGAGC CGTGGAGCGC CACTCAGATT TGCCACCCCT GAACCCCACC TCGTCGTCGG 540 

CAGATYTGAG GGCTCTGGCA AGAAATCCTA GTTTCGTCCT GTCTTCCCTG GGCTTCACTG 600 

CTGTGGCCTT TGTCACGGGC TCCCTGGCTC TGTGGGCTCC GGCATTCCTG CTGCGTTCCC 660 

35 

GCGTGGTCCT TGGGGAGACC CCACCCTGCC TTCCCGGAGA CTCCTGCTCT TCCTCTGACA 720 

GTCTCATCTT TQGACTCATC ACCTGCCTGA CCGGAGTCCT GGGTGTGGGC CTGGGTGTGG 780 

40 AGATCAGCCG CCGGYTCCGC CACTCCAACC CCCGGGCTGA TCCCCTGGTC TGTCCCACTC 840 

GCCTCCTGGG CTCTGCACCC TTCCTCTTCC TGTCCCTTGC CTGCGCCCGT GGTAGCATCG 900 

TGGCCACTTA TATTTTCATC TTCATTGGAG AGACCCTCCT GTCCATGAAC TCGGCCATCG 960 

45 

TGGCCGACAT TCTGCTGTAC GTGGTGATCC CTACCCGACG CTCCACCGCC GAGGCCTTCC 1020 

AGATCGTGCT GTCCCACCTG CTGGGTGATG CTGQGAGCCC CTACCTCATT GGCCTCATCT 1080 

50 CTGACCGCCT GCGCCGGAAC TGGCCCCCCT CCTTCTTGTC CGAGTTCCGG GCTCTCCAGT 1140 

TCTCGCTCAT GCTCTGCGCG TTTGTTGGGG CACTGGGCGG CGCACTTTCC TGGGCACCGN 1200 

CATCITCATT GAGGCCGACC GCCGGCGGGC ACAGCTGCAC GTGCAGGGCC TGCTGCACGA 1260 

AGCAGQGTCC ACAGACGACC GGATTGTGGT GCCCCAGCGG GGCCGCTCCA CCCGCGTGCC 1320 

CGTGGCCAGT GTGCTCATCT GAGAGGCTGC CGCTCACCTA CCTGCACATC TGCCACAGCT 1380 

60 KGCCCTGGGC CCACCCCACG AAGGGCCTGG GCCTAACCCC TTGGCCTGGC CCAGCTTCCA 1440 
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GAGGGACCCT GGGCCGTGTG CCAGCTCCCA GACACTACWr GGGTAGCTCA GGGGAGGAOG 1500 

TGGGGGTCCA GGAGGGQGAT CCCTCTCTAC AGGGGNCACC CCAAGGGCTC GGTGCTATTT X560 

GTAACGGAAT AAAATTTOTA GCCAGACXXX: AGGTGCCTGC TCTCGTCTTT CTCTGGGTGG 1620 

CCTCTGATCT TGCACCCCX5T CTTCACCCCA GGGCTCCTGA A 1661 

(2) INFOHMATIOtN FOR SEQ ID NO: 242: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUEt«:E DESCRIPTION: SEQ ID NO: 242: 

NGACAGAAAA GCAGAAGATG AGACTCTGTT CATTCACTTT TCCTAGGCCC ATCCTGTGGT 60 

25 CATCTTTCCC CCTCCCATCA TACCTCCTCC TTCCTGGAGC CTCTQCCGGC TTGGCTGTAA 120 

TGGTGGCACT TACCTGGATA TTTCAGTGGG AGGATGAAAG GCGAGACTCA CCCTACGCGG 180 

TGGGACAGAT GGGGAGAGGA AAAAGGCAGA GATiSGCCAGG AGAGGGGTGC AGGACAAACC 240 

30 

AGAGAGGTTG GGTCAGQGGA AAAGTGTNGG GAGAAAGTGG GGTGCAGGCC CTGCAGGCCG 300 

GTTTAGCCAG CAGCTGCGGC CTCCCCGGGC CCTTGGCATC CAACTTCGCA GACAGGGTAC 360 

35 CAGCCTCCTG GTGTGTATCA TAGGATTTGT TCACATAGTG TTATGCATGA TCTTCGTAAG 420 

GTTAAGAAGC CGTGGTQGTG CACCATGACA TCCAA^XXTCT ATATATAAAG ATAAATATAT 480 

ATATATATGT ATGTAAATTA T7KGCACTGAG GGCCCTGCTG CCCTGCTGGA CCAAGCAAAA 540 

40 

CTAAGCCTTT TGGTTTGGGT ATTATGTTTC GTTTTGTTAT TTGTTTGTTT TTGTGGCTTG 600 

TCTTATGTCG TGATAGCACA AGTGCCAGTC GGATTGCTCT GTATTACAGA ATAGTGTTTT 660 

45 TAATTCATCA ATGTTCTAGT TAATGTCTAC CTCAGCACCT CCTCTTAGCC TAATTTTAGG 720 

AGGTTGCCCA ATTTTGTTTC TTCAATTTTA CTGGTTACTT TTTTGTACAA ATCAATCTCT 780 

TTCTCTCTTT CTCTCCTCCC CACCTCTCAC CCTTGCCCTC TCCATCTCCC TCTCCCGCCC 840 

50 

TCCCCTCCrC CCTCTGGCTC CCCGTCTCAT TTCTGTCCAC TCCATTCTCT CTCCCTCTCT 900 

CCTGCCTCCT GCTGCCCCCT CCCCAGCCCA CTTSCCCGAG TTGTGCTTGC CGCTCCTTAT 960 

55 CTGTTCTAGT TCCGAAGCAG TTTCACTCGA AGTTGTGCAG TCCTGGTTGC AGCTTTCCGC 1020 

ATCTGCCTTC GTTTCGTGTA GATTGACGCG TTTCTTTGTA ATTTCAGTGT TTCTGACAAG 1080 

ATTTAAAAAA AAAAAAAGGA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG GCCCGGTACC 1140 
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CAATTG 



1146 



10 



15 



20 



25 



30 
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40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

AACCCACGGC TGCIXSCGGCA GGGCGTGGAG GGCAGAGGGC CGCGGAGGCG CAGTTCCAAA 60 

CATGGCTCAG AGCAGAGACG GCGGAAACCC GTTCGCCGAG CCCAGCGAGC TTGACAACCC 120 

CTTTCAGCCA CCACCAGCCT ATGAGCCTCC AGCCCCTGCC CCATTGCCTC CACCCTCAGC 180 

TCCCTCCTTG CAGCCCTCGEA GAAAGCTCAG CCCCACAGAA CCTAAGAACT ATCGCTCATA 240 

CAGCACTCAG GCCTCAGCTG CAGCAGCCAC AGCTGAGCTG CTGAAGAAAC AGGAGGAGCT ioO 

CAACCGGAAG GCAGAGGAGT TGGACCGAAG GAGNCGAGAG CTGCAGCATG CTGCCCTGGG 360 

RGGCACAGCT ACTCGACAGA ACAATTGGCC CCCTCTACCT TCTTTTTGTC CAGTTCAGCC 420 

CTGCTTTTTC CAGGACATCT CCATGGAGAT CCCCCAAGAA TTTCAGAAGA CTGTATCCAC 480 

CATGTACTAC CTCTGGATGT GCAGCACGST GGCTCTTCTC CTGAACTTCC TCGCCTGCCT 540 

GGCCAGCTTC TGTGTGGAAA CCAACAATGG CGCAGGCTTT GGGCTTTCTA TCCTCTGGGT 600 

CCTCdTTTC ACTCCCTGCT CCTTTGTCTG CTGGTACCGC CCCATGTATA AGGCTTTCCG 660 

GAGTGACAGT TCATTCAATT TCTTCGTTTT CTTCTTCATT TTCTTCGTCC AGGATGTCCT 720 

CTTTGTCCTC CAGGCCATTG GTATCCCAGG TTGGGGATTC AGTGGCTGGA TCTCTGCTCT 780 

GGTGGTGCCG AAGGCAACAC AGCAGTATCC GTGCTCATGC TGCTGGTCGC CCTGCTCTTC 840 

ACTGGCATTG CTGTGCTAGG AATTGTCATG CTGAAACGGA TCCACTCCTT ATACCGCCGC 900 

ACAGGTGCCA GCTTTCAGAA GGCCCAGCAA GAATTTGCTG CTGGTGTCTT CTCCAACCCT 960 

GCGGTGCGAA CCGCARCTTG CCAATGCAGC CGCTGGGGCT GCTGAAAATC CCTTCCGGGC 1020 

CCCGTGACCC CTGACTQGGA TGCCCTGGCC CTGCTACTTG AGGGAGCTGA CTTAGCTCCC 1080 

GTCCCTAAGG TCTCTGGGAC TTGGAGAGAC ATCACTAACT GATGGCTCCT CCGTAGTCCT 1140 

CCCAATCCTA TGGCCATGAC TGCTGAACCT GACAGGCGTG TGGGGAGTTC ACTOTGACCT 1200 

AGTCCCCCCA TCAGGCCACA CTGCTGCCAC CTCTCACACG CCCCAACCCA GCTTCCCTCT 1260 

GCTGTGCCAC GGCTGTTGCT TCGGTTATTT AAATAAAAAG AAAGTGGAAC TCGAAAAAAA 1320 

AAAAAAAAAA AAAAAAAAAG GGGGCajCCNC 1350 
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5 (2) INFORMATI(»J FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

15 TCCCAGAQCX: CGGGGGGTTC CAGCTCTGCC TGTAGCAGAG CCCTGAGGAG GAGGAGGAAG 60 

AGGATGTGCT GAAATACGTC CGGGAGATCT TTTTCAGCTA GGGCATAAAC TGTGCACTGA 120 

ACTGTCTGCC GAGAGCAGCT GGAGGACAGC TGAGCTTCCA CTGGTGCTGC TGGGCCGMCC 180 

20 

GCCTGTGGGA ATGGGGCTCT CTGTGCTCCT ACCTTTGTGC CTTCTTGGGC CTGGCAGATT 240 

CACCTCAGGC CAGAAGCCCC TGGACACTCC GGGCCTTGGG GTGCCGTTCT GAGTGTCCGG '300 

25 AAGGCAGGAC TCAAAATC5AG ATCCCATTTG ACTCCCTCTG TATGTACTGT GCCCTCTCCT 360 

GGCTCTTGAG GCTCTGGAGT CCCAATTGTC TGTGTTAGTC AGTGACCAQG TTCCAGGGAA 420 

AATRATGTCA TGTGGTGGTC CAACTTACTG GAACCAAAGA GACAGTACTT TGCAAAGAAA 480 

30 

AGGATCACTG CCAGGTGCAC TGGAATTGCT ACAGTTTAGT CCGCATGATC TCTCCTGAAG 540 

GAGGAAGCCT GTTTCAAAAA TAGTTTCCAT CATGAGTCTA TCAATGAGCT CCCACCTCTC 600 

35 CAGCCAGCCT AGAAAGCAAA CGAGCTGCCC ACAGTTCTCT GCCCTGTCTG GGAGGTTGAG 660 

GCCACAGTGT ATAGACTGGT AAGCCAGACA GGCCTCCTCC CGCAAGCTGC TACCTTGCTT 720 

TCACCTGTAC CTTGGTCCCC GGGCAGCTAG CTATAAAGCA AGAGQGACAG GAGCCCAGAA 780 

40 

GAGACACTGA GGACAAGAGA TCACACCAGA GTACATGTCT CTGCCTCTGT TTTCAGTGTG 840 

GCTTTGGACA GGAATATATG AATAAATCAC TGCCATACAG GTTTTCCAAT ACACAAGTGC 900 

45 TAGAAAATAC ACACAATTCC CCAATGCGTA AGTTGTGCTA ATGTCTTTCC AAGTTCTGGG 960 

TTGGGAAGTG GAGGGTGGCA GCGTTTGTTT GTGCGCAACC GTCCAGTCCT GTTCACAGCG 1020 

AGGATTTGGA GTCCTCCAGG GTCTCATCAT GGGAGTGATT TGTCAGCGGA CGCCTCTGCC 1080 

50 

CTGTCTGGCT TCAGGTCCAG GGAAGCTTTG AAGCAGTCAA GCCTTGTCTT TGTACCCCAT 1140 

GTGTCCTGTC TTTGTTGAGT CACTCAGAGA TCACTCCTGG ACCTCTGGGG TTGGAGTTCC 1200 

55 AGTGATGGCT TATGGCGGCC CACTCACTAT GGTGGGCTGA GTGGAAGCTC CTTAACCATG 1260 

TCCCCAGAGA CACTGAGGTG CTCGCTCTTT TAATGTCCTC GTTTGTTGCC GTAAGTTCTT 1320 

TGCTAGGTTT CATTTTGGCA TTTGGCAAAT CAGCCTGGAA GTCTGGCCCC ATGACAGCAA 1380 
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TCACTCCCTC CCCACCXTTCC TGAAGCTAGA GGAAGATTTG CTCAGATCCA TTAATTAAAG 1440 

CAGGAATTGG TGTTGACAATG AGCTGCATGG TTTAGGGAGT CTTTGGGAGC CTTGGAAGTC 1500 

5 CTGAAGGACA AACAATCTTG TACTAAGAA 1529 



10 (2) INFORMATION FOR SEQ ID NO: 245: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

GTGCGAGGTC CCCGCCAGCC CCCAGCGGCC TTCCCGQCCC GGGGCGCTCC CAGAGCAAAC 60 

GAGGCCCCTG AGAGCTCCAC CTAGTTCACA GGATAAAATC CCACAGCAGA ACTCGGAGTC 120 

AGCAATGGCT AAGCCCCAGG TGGTTGTAGC TCCTGTATTA ATGTCTAAGC TGTCTGTGAA 180 

TGCCCCTGAA TTTTACCCTT CAGGTTATTC OTCCAGTTAC ACAGAATCCT ATGAGGATGG 240 

TTGTGAGGAT TATCCTACTC TATCAGAATA TGTTCAGGAT TTTTTGAATC ATCTTACAGA 300 

GCAGCCTGGC AGTTTTGAAA CTGAAATTGA ACAGTTTGCA GAGACCCTGA ATGGTTGTGT 360 

TACAACAGAT GATGCTTTGC AAGAACTTGT GGAACTCATC TATCAACAGG CCACATCTAT 420 

CCCAAATTTC TCTTATATGG GAGCTCGCCT GTGTAATTAC CTGTCCCATC ATCTGACAAT 480 

TAGCCCACAG AGTGGCAACT TCCGCCAATT GCTACTTCAA AGATGTCGGA CTGAATATGA 540 

AGTTAAAGAT CAAGCTGCAA AAGGGGATGA AGTTACTCGA AAACGATTTC ATGCATTTGT 600 

ACTCTTTCTG GGAGAACTTT ATCTTAACCT GGAGATCAAG GGAACAAATG GACAGGTTAC 660 

AAGAGCAGAT ATTCTTCAGG TTGGTCTTCG AGAATTGCTG AATGCCCTGT TTTCTAATCC 720 

TATGGATGAC AATTTAATTT GTGCAGTAAA ATTGTTAAAG TTGACAGGAT CAGTTTTGGA 780 

AGATGCTTGG AAGGAAAAAG GAAAGATGGA TATGGAAGAA ATTATTCAGA GAATTGAAAA 840 

CGTTGTCCTA GATGCAAACT GCAGTAGAGA TGTAAAACAG ATGCTCTTGA AGCTTCTAGA 900 

ACTCCGGTCA AGTAACTGGG GCAGAGTCCA TGCAACTTCA ACATATAGAG AAGCAACACC 960 

AGAAAATGAT CCTAACTACT TTATGAATGA ACCAACATTT TATACATCTG ATGGTGTTCC 1020 

TTTCACTGCA GCTGATCCAG ATTACCAAGA GAAATACCAA GAATTACTTG AAAGAGAGGA 1080 

CTTTTTTCCA GATTATGAAG AAAATGGAAC AGATTTATCC GGGGCTGGTG ATCCATACTT 1140 

GGATGATATT GATGATGAGA TGGACCCAGA GATAGAAGAA GCTTATGAAA AGTTTTGTTT 1200 

GGAATCAGAG CGTAAGCGAA AACAGTAAAG TTAAATTTCA GCATATCAGT TTTATAAAGC 1260 
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AGTTTAGGTA TGGTGATTTA GCAGAACACA 
CAAATTRAGG ATGTTGAGTT ATGTTACTAA 
TCTGCCAAAA TAAACTTTAT TCCCTATAAC 
TT ATGTACAG TTAATTCTAC TGTTTTQGCT 
AAAAAAAAAA AAGGCa«3GCC GCTCTAGAGG 



AGAGAGCAAG AAAATGTGTC ACATCTATAC 
TGTATGCAAC TTTAATTTTG TTTAACACTA 
1TAAAATGTG TATATATATA TAATAGITTA 
GCAATAAAAT CGATTTTGAA ATAAAWRAAA 
ANCCAAG 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 
TGCAGGATTT GGCCAGGACC CSCCGCGGTG GCGGTTGCTA TCGCTTCGCA GAACCTACTC 
AQGCAGCCAG CTGAGAAGAG TTGAGGGAAA GTGCTGCTGC TGGGTCTGCA GACGCGATGG 
ATAACGTGCA GCCGAAAATA AAACATCGCC CCTTCTGCTT CAGTGTGAAA GGCCACGTGA 
AGATGCTGCG GCTGGATATT ATCAACTCAC TGGTAACAAC AGTATTCATG CTCATCGTAT 
CTGTGTTGGC ACTGATACCA GAAACCACAA CATTGACAGT TGGTGGAGGG GTGTTTGCAC 
TTGTGACAGC AGTATGCTGT CTTGCCGACG GGGCCCTTAT TTACCGGAAG CTTCTGTTCA 
ATCCCAGCGG TCCTTACCAG AAAAAGCCTa TGCATGAAAA AAAAGAAGTT TTGTAATTTT 
ATATTACTTT TTAGTTTGAT ACTAAGTATT AAACATATTT CTGKATTATT CCAAAAAAAA 
AAAAAAAAAA AAAAAAAATT TQGTGG 



(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1348 bctse pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 247: 
GTCTTTCTTT TNCTGTTTTG AGTTGGTGAG TGAGTGAATA GGGTAACATG GGCCTTCAGG 
ATGACCCCTT GGAACTGTGC CGAGTTCCTT AAATCTCAGC TGGGATCCTG GACCTGGGAG 



GCCCCTGTGA GGGCCAGCTC TGGAAAAACC TGGGAGTTGA TGCCGGAGGY TGGGAAGAAC 
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TCTGCTCGAG GGCAGGGTGC CCTGGAACAC TGGTAGTTCT GGGGCTGGGA GGGM3AGGGG 240 

CTCCGGCTTT CTCTCAAATG AACACTQCTC TTCAGCAGTT CAAGTACTTG TTCTCAAAAC 300 

A1TTTCTAAT TGATTGGTAG GTOTCATAA GCATTGTTrC TTTAAGGCAT GGAAAGGGAA 360 

GAATGCTCAA GCAAGTCATG TTTGTTTTCA GTGGGATGGG CCCX3CGTTCT CACTGCTGGG 420 

GGCTTCCCCT TGCATGTGGC ACCTTTC5TGC AGGGCCACCA GGCAGACTCT TCCCACCTTC 480 

TCCCACTGAA GCACCAAGGG GCTTGAACCG TAATTTGGCT AATCAGAGGC ATTTTTTTTG 540 

TCCTAGTATC TTTCACACTT GTCCAACCGT CTTATTTTTT TAAAAGTTCT GTTGCTTCTA 600 

TTAACACGAA ACTAGAGAGA AATAGTTTCT GAAGCCAGTT TATTGTGAAG ATCCCCAAGG 660 

GGAGGTTCGG TAGAGAAAAA TAGTAAQCTG GTTTAGAAAC TGACGAGGGC AAACAGCCAG 720 

GACGCATTGG AGAGGAATTT GCCAAAGATC TACCOTGAGA TAACGCCTGT CC7W3TGTCTT 780 

CACCACGTGA ATAACCAGCG CTCCAAAGTG TTTTTCTGCT TTGAAAAAAA AAATTCCACA 840 

AGCTTTTAAA GGTGCATTTA AGAATOCATG TGACTTTAGA ATGGAACTGC CGGCX:CTGGC 900 

AACTGTCACG TGTGCTAGAA GGTTCGATGC CTCTGGAATG CATGTGATAC TCATCTCCAT 960 

TTTGTTTCCT TGATTGCATT T r TG Vr CVV V TAGCAGATCT GTCCCTGTGG GTGGTGTCTA 1020 

AGAAGTCGGA CACCTTQGTT TTTGTGTTAG ATTGAGCTGG GCAGCTGCAA TCAGCTTCTT 1080 

TATATGCAAA TTAGGCACGA CCX:ATCTGTG GTTCCCTGGT TGGTGGCTAA TGAAGTGAGG 1140 

GGAGGGAGGG ATGTCACCCC AAAAGTAGGC CCTCCCATTG GCTTTGGCCA GGCCAGACAC 1200 

TTCACATCGT TTACATGGTT CTGTGTAATT TTAAAGTTTA TGTGTATAAA GCGAAGCTGT 1260 

TTCTGTGAAA CTGTATATTT TGTAAATAAA TATATTGCTA CTTTGAGAWR AAAAAAAAAA 1320 

AAAAACTCGA GGGGGGCCCG GTACCCAA 1348 



45 (2) I^FFORMATION FOR SEQ ID NO: 248: 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEfaSTESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 
GTGCCGAATC GGCAGAGCGG CACGAGCGGC CACGAGAGCA GGCGGAGTAA AGGGACTTGA 
GCGAGCCAGT TGCCGGATTA TTCTATTTCC CCTCCCTCTC TCCCGCCCCG TATCTCTTTT 
CACCCTTCTC CCACCCTCGC TCGCGTASCA TGGCGGAGCG TCGGCGGCCA CTCAGTCCCA 



60 
120 
180 
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TTCCATCTCC TCGTCGTCCT TCGGAGCCGA GCCGTCCGCG CCXX3GCGGCG GCGGGAGCCX: 240 

AGGAGCCTGC CCCGCCCTGG GGACGAAGAG CTGCAGCTCC TCCTGTGCGG TGCACGATCT 300 

GATTTTCTGG AGAGATGTGA AGAAGACTGG GTTTGTCTTT GGCACCACGC TGATCATGCT 360 

GCTTTCCCTG GCAGCTTTCA GTGTCATCAG TGTGGTTTCT TACXTTCATCC TGGCTCTTCT 420 

CTCTGTCACC ATCAGCTTCA GGATCTACAA GTCCGTCATC CAAGCTGTAC AGAAGTCAGA 480 

AGAAGGCCAT CCATTCAAAG CCTACCTGGA CGTAGACATT ACTCTGTCCT CAGAAGCTTT 540 

CCATAATTAC ATGAATGCTG CXATGGTGCA CATCAACAGG GCCCTGAAAC TCATTATTCG 600 

TCTCTTTCTG GTAGAAGATC TGGTTGACTC CTTGAAGCTG GCTGTCTTCA TGTGGCTGAT 660 

GACCTATGTT GGTGCTGTTT TTAACGGAAT CACCXnTCTA ATTCTTGCTG AACTGCTCAT 720 

TTTCAGTGTC CCGATTGTCT ATGAGAAGTA CAAGACCCAG ATTGATCACT ATGTTGGCAT 780 

CGCCCGAGAT CAGACCAAGT CAATTGTTGA AAAGATCCAA GCAAAACTCC CTGGAATCGC 840 

CAAAAAAAAG GCAGAATAAG TACATGGAAA CCAGAAATGC AACAGTTACT AAAACACCAT 900 

TTAATAGTTA TAACXyTCGTT ACTTGTACTA TGAAGGAAAA TACTCAGTGT CAGCTTGAGC 960 

CTGCATTCCA AGCTTTTTTT TTAATTTGGT GTTTTCTCCC ATCCTTTCCC TTTAACCCTC 1020 

AGTATCAAGC ACAAAAATTG ATGGACTGAT AAAAGAACTA TCTTAGAACT CAGAAGAAGA 1080 

AAGAATCAAA TTCATAGGAT AAGTCAATAC CTTAATGGTG GTAGAGCCTT TACCTGTAGC 1140 

TTGAAAGGGG AAAGATTGGA GGTAAGAGAG AAAATGAAAG AACACCTCTG GGTCCTTCTG 1200 

TCCAGTTTTC AGCACTAGTC TTACTCAGCT ATCCATTATA GTTTTGCCCT TAAGAAGTCA 1260 

TGATTAACTT ATGAAAAAAT TATTTGGGGA CAGGAGTGT6 ATACCTTCCT TGGTTTTTTT 1320 

TTGCAGCCCT CAAATCCTAT CITCCTGCCC CACAATGTGA GCAGCTACCC CTGATACTCC 1380 

TTTTCTTTAA TGAITTAACT ATCAACOTGA ' TAAATAACTT ATAGGTGATA GTGATAATTC 1440 

CTGATTCCAA GAATGCCATC TGATAAAAAA GAATAGAAAT GGAAAGTGGG ACTGAGAGGG 1500 

AGTCAGCAGG CATGCTGCGG TGGCX3GTCAC TCCCTCTGCC ACTATCCCCA GGGAAGGAAA 1560 

RGCTCCGCCA TTTGGGAAAG TGGTTTCTAC GTCACTGGAC ACCGGTTCTG AGCATTAGTT 1620 

TGAGAACTCG TTCCCGAATG TGCTTTCCTC CXTTCTCCCCT GCCCACCTCA AGTTTAATAA 1680 

ATAAGGTTGT ACTTTTCTTA CTATAAAATA AAAAAAAAAA AACTCGAGGG GGGCCCXOTA 1740 

CCCAAATCGC CGGATATGAT CGTAAA 1766 



(2) INFORMATION FOR SEQ ID NO: 249: 
60 <i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2664 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTICM^: SEQ ID NO: 249: 

AGTGTCCTCG GAGCAGGCGG AGTAAAGGGA CTTGAGCGAG CXy^GTTGCCG GATTATTCTA 60 

10 TTTCCCCTCC CTCTCTCCCG CCCCX3TATCT CTTTTCACCC TTCTCCCACC CTCGCTCGCG 120 

TASCATGGCG GAGCX?rcGGC GGCCACTCAG TCCCATTCCA TCTCCTCGTC GTCCTTCGGA 180 

GCCGAGCCGT CCGCGCCCGG CGGCGGCGGG AGCCCAGGAG CCTGCCXXTGC CCTGGGGACG 240 

15 

AAGAGCTGCA GCTTCCTCCTG TGCGGTGCAC GATCTGATTT TCTGGAGAGA TGTGAAGAAG 300 

ACTGQGTTTG TCTTTGGCAC CACGCTGATC ATGCTGCTTT CCCTGGCAGC TTTCAGTGTC 360 

20 ATCAGTGrTGG TTTCTTACCT CATCCTQGCT CTTCTCTCTG TCACCATCAG CTTTCAGGATC 420 

TACAAGTCCG TCATCCAAGC TGTACAGAAG TCAGAAGAAG GCCATCCATT CAAAGCCTAC 480 

CTGGACGTAG ACATTACTCT GTCCTCAGAA GCTTTCCATA ATTACATGAA TGCTGCCATG 540 

25 

GTGCACATCA ACAGGGCCCT GAAACTCATT ATTCGTCTCT TTCTGGTAGA AGATCTGGTT 600 

GACTCCTTGA AGCTGGCTGT CTTCATGTGG CTGATGACCT ATGTTGGTGC TGTTTTTAAC 660 

30 GGAATCACCC TTCTAATTCT TGCTGAACTG CTCATTTTCA GTGTCCCGAT TGTCTATGAG 720 

AAGTACAAGA CCCAGATTGA TCACTATGTT GGCATCGCCC GAGATC7W3AC CAAGTCAATT 780 

GTTGAAAAGA TCCAAGCAAA ACTCCCTGGA ATCGCCAAAA AAAAGGCAGA ATAAGTACAT 840 

35 

GGAAACCAGA AATGCAACAG TTACTAAAAC ACCATTTAAT AGTTATAACG TCGTTACTTG 900 

TACTATGAAG GAAAATACTC AGTGTCAGCT TGAGCCTGCA TTCCAAGCTT TTTTTTTAAT 960 

40 TTGGTGTTTT CTCCCATCCT TTCCCTTTAA CCCTCAGTAT CAAGCACAAA AATTGATGGA 1020 

CTGATAAAAG AACTATCTTA GAACTCAGAA GAAGAAAGAA TCAAATTCAT AGGATAAGTC 1080 

AATACCTTAA TGGTGGTAGA GCCTTTACCT GTAGCTTGAA AGGGGAAAGA TTGGAGGTAA 1140 

45 

GAGAGAAAAT GAAAGAACAC CTCTGGGTCC TTCTGTCCTtfS TTTTCAGCAC TAGTCTTACT 1200 

CAGCTATCCA TTATAGTTTT GCCCTTAAGA AGTCATGATT AACTTATGAA AAAATTATTT 1260 

50 GGGGACAGGA GTGTGATACC TTCCTTGGTT TTTTTTTGCA GCCCTCAAAT CCTATCTTCC 1320 

TGCXrCCACAA TGTGAGCAGC TACCCCTGAT ACTCCTTTTC TTTAATGATT TAACTATCAA 1380 

CTTGATAAAT AACTTATAGG TGATAGTGAT AATTCCTGAT TCCAAGAATG CCATCTGATA 1440 

55 

AAAAAGAATA GAAATGGAAA GTGGGACTGA GAGGGAGTCA GCAGGCATGC TGCGGTGGCG 1500 

GTCACTCCCT CTGCCACTAT CCCCAGGGAA GGAAARGCTC CGCCATTTGG GAAAGTQGTT 1560 

60 TCTAOSTCAC TGGACACCGG TTCTGAGCAT TAGTTTGAGA ACTCGTTCCC GAATGTGCTT 1620 
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TCCTCCCTCT CCCXTOCCCA CXrrCAAGTTT AATAAATAAG GTTGTACTrT TCTTACTATA 1680 

AAATAAATCT CTGTAACTGC TGTGCACTGC TGTAAACTTG TTAGAGAAAA AAATAACCTG 1740 

CATGTGGGCT CCTCAGTTAT TGAGTTTTTG TGATCCTATC TCAGTCTGGG GGGGAACATT 1800 

CTCAAGAGGT GAAATACAGA AAGCCTTTTT TTCTTGATCT 1TTCCCGAGA TTCAAATCTC 1860 

CX3ATTCCCAT TTCGGGGCAA GTTTTTrrCT TCACCTTCAA TATGAGAATT CAGCGAACTT 1920 

GAAAGAAAAA TCATCTGTGA GTTCCTTCAG GTTCTCACTC ATAGTCATGA TCCTTCAGAG 1900 

GGAATATGCA CTGGCGAGTT TAAAGTAAGG GCTATGATAT TTGATGGTCC CAAAGTACGG 2040 

CAGCTGCAAA AAGTAGTGGA AGGAAATTGT CTACGTGTCT TGGAAAAATT AGTTAGGAAT 2100 

TTGGATGGGT ATU^AGGTACC CTTGCCTTAC TCCATCTTAT TTTCTTAGCC CXrCTTTGAGT 2160 

GTTTTAACTG GTTTCATGTC CTAGTAGGAA GTgCATTCTC CATCCTCATC CTCTCCCCTC 2220 

CCAGGAAGTC AGTGATTGTC TTTTTGGGCT TCCCCTCCAA AGGACCTTCT GCAGTGGAAG 2280 

TGCCACATCC AGTTCTTTTC TTTTGTTGCT GCTGTOTTTA GATAATTGAA GAGATCTTTG 2340 

TGCCACACAG GATTTTTTTT TITTTTAAGA AAAACCTATA GATGAAAAAT TACTAATGAA 2400 

ACTGTGTGTA CGTGTCTGTG CX3TGCAACAT AAAAATACAG TAGCACCTAA GGAGCTTGAA 2460 

TCTTGGTTCC TGTAAAATTT CAAATTGATG TGGTATTAAT AAAAAAAAAA AAT^ACAMAAA 2520 

AAAAAAAAAA AAAAGGGCGG CCX5CTCTAGA GGATCCAAGC TTACGTACGC GTGCATGCGA 2580 

CGTCCATAGC TCTTTCTATA GGGGTCCCCC AAATTCCATT CANCGGGCCG TCGGTTTTAN 2640 

AAAGGTCGTG AOTGGGGGAA ANCC 2664 
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(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
CGTGGGAGTG AGGTACCAGA TTCAGCCCAT TTGGCCCCGA CGCCTCTKTT CTCGGAATCC 
GGGTGCTGCG GATTGAGGTC CCGGTTCCTA ACGGTGGGAT CGGTGTCCTC GGGATGAGAT 
TTGGCGTTTC CTCGGGGCTT TGGTQGGATC GGTGTCCTCA GGATGAGATT TAGGGTTTCC 
TCGGGGCTTT CGGGATCTTC ACCTAATATC CGGACTGCAA GATGGAGGAA GGCGGGAACC 
TAGGAGGCCT GATTAARATG GTCCATCTAC TGGTCTTGTC AGGTGCCTGG GGCATGCAAA 



60 
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TGTGGGTGAC CTTCGTCTCA GGCTTCCTGC 
GACTAGTGCA GAGCAAACTC TTCCCCTTCT 
5 TCAACCTCTG CATCTTGGCT TCACAGCATG 
GCCAGCTTTA CCTGCTGTTC CTGAGCCTTA 
AACCCCGCAC CACAGCTGCC ATGTGGGCCC 

10 

GTCGGGAGGT ACCAGGCAGC CACCAGGCTTC 
ACCCCAAGTA CAC5TCCTCTC CGCXIAGAATT 
15 GCAATCTCGG CTGCGTCCTG AGCAATGGGC 
GGAGCCTCTA GCATGGGCCC TGCATGCTAA 
AAACTCGAGG GGGGCCCGGT ACCCA 

20 



TTTTCCX3AAG CXTTTCCCCGA CATACCTTCG 360 

ACTTCCACAT CTCCATGGGC TGTGCCTTCA 420 

CTTGGGCTCA GCTCACATTC TGGGAC5GCCA 480 

CGCTQGCCAC TGTCAACGCC CX3CTGGCTGG 540 

TGCAAACCGT GC5AGAAGGAG CGAGGCCTGG 600 

CCGATCCCTA CCGCXAGCTG CGAGAGAAGG 660 

TCTTCCGCTA CCATGGGCTG TCCTCTCTTT 720 

TCTGTCTCGC TGGCCTTGCC CTGGAAATAA 780 

TAAATGCTTC TTCAGAAAAA AAAAAAAAAA 840 

865 



25 



<2) INFORMATION FOR SEQ ID NO: 251: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 



35 



(xi) SBQUEbTCTE DESCRIPTION: SEQ ID NO: 251: 

TGGGGGGC5GN AATGQGTGTC TGGCTCANGG ATTGCCNAAT CTGGAAATTC TCCATAACTT 60 

GCTAGCTTGT TTTTTTTTTT TTTTTTTACA CCCCCCCGCC CCACCCCCOG ACTTCCACAA 120 

TGTTCAATGA TCTCAGCAGA GTTCTTCATG TGA7VACGTTG ATCACCTTTG AAGCCTGCAT 180 

40 CATTCACATA TTTTTTCTTC TTCTTCCCCT TCAGTTCATG AACTGGTGrT CATTTTCTGT 240 

GTGTGTGTGT GTTTTATTTT GTTTGGATTT TTTTTTrTAA TTTTACTTTT AGAGCTTGCT 300 

GTGTTGCCCA CCTTTTTrCC AACCTCCACC CTCACTCCTT CTCAACCCAT CTCTTCCGAG 360 

45 

ATGAAAGAAA AAAAAAAGCA AAGTTnTTT TTCTTCTCCT GAGTTCTTCA TGTGAGATTG 420 

AGCTTGCAAA GGAAAAAAAA ATGTGAAATG TTATAGACTT GCAGCGTGCC GAGTTCCATC 480 

50 GGGTTTTTTT TTTAGCATTG TTATGCTAAA ATAGAGAAAA AAATGCTCAT GAACCTTCCA 540 

CAATCAAGCC TGCATCAACC TTCTGGGTGT GACTTGTGAG TTTTGGCCTT GTGATGCCAA 600 

ATCTGAGAGT TTAGTCTGCC ATTAAAAAAA CTCATTCTCA TCTCATGCAT TATTATGCTT 660 

55 

GCTACTTTGT CTTAGCAACA ATGAACTATA ACTGTTTCAA AGACTTTATG GAAAAGAGAC 720 

ATTATATTAA TAAAAAAAAA AAGCCTGCAT GCTGGACATG TATGGTATAA TTATTTTTTC 780 

60 CTTTTTTTTT CCTTTTGGCT TGGAAATGGA CGTTCGAAGA CTTATAGCAT GGCATTCATA 840 
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CTTTTGTTTT ATTGCCTCAT GACTTTTTTG AGTTTAGAAC AAAACAGTGC AACCGTAGAG 900 

CCTTCTTCCC ATGAAATTTT GCATCTGCTC CAAAACTGCT TTGAGTTACT CAGAACTTCA 960 

5 

ACCTCCCAAT GCACTGAAGG CATTCCTTGT GCAAAGATAC CAGAATOGGT TACACATTTA 1020 

ACCTGGCAAA CATTGAAGAA CTCTTRATGT TTTCTTTTTA ATAAGAATGA CGCCCCACTT 1080 

10 TGGGGACTAA AATTGTGCTA TTGCCGAGAA GCAGTCTAAA ATTTATTTTT TAAAAAGAGA 1X40 

AACTGCCXXIA TTATTTTTGG TTrGTTTTAT TTTTATTTTA TATTTTTTGG CTTTTGGTCA 1200 

TTGTCAAATG TGGAATGCTC TGGGTTTCTA GTATATAATT TAATTCTAGT TTTTATAATC 1260 

15 

TOTTAGCCCA GTTAAAATGT ATGCTACAGA TAAAGGAATG TTATAGATAA ATTTGAAAGA 1320 

GTTAGGTCTG TITAGCTGTA GATTTTTTAA ACGATTGATG CACTAAATTG TTTACTATTG 1380 

20 TGATGTTAAG GGGGGTAGAG TTTGCAAGGG GACTTGrTTAA AAAAAGTAGC TTATACAGCA 1440 

TGTGCTTGCA ACTTAAATAT AAC3TTGGGTA TGTGTAGTCT TTGCTATACC ACTGACTGTA 1500 

TTGAAAACCA AAGTATTAAG AGGGGA7UVCG CCCCTGTTTA TATCTGTAGG GGTATTTTAC 1560 

25 

ATTCAAAAAT GTATGTTTTT TTTTCTTTTC AAAATTAAAG TATTTGGGAC TGAATTGCAC 1620 

TAAGATATAA CCTGCAAGCA TATAATACAA AAAAAAATTG CAAAACTGTT TAGAACGCTA 1680 

30 ATAAAATTTA TGCAGTTATA AAAATGGCAT TACTGCACAG TTTTAAGATG ATGCAGATTT 1740 

TTTTACAGTT GTATTGTGGT GCAGAACTGG ATTTTCTGTA ACTTAAAAAA AAATCCACAG 1800 

TTTTAAAGGC AATAATCAGT AAATGTTATT TTCAGGGACT GACATCCTGT CTTTAAAAAG 1860 

35 

AAATGAAAAG TAAATCTTAC CACAATAAAT ATAAAAAAAT CTTGTCAGTT ACTTTrCnT 1920 

TACATA1TTT GCTGTCCAAA ATTGTTTTAT ATCTTGAGTT ACTAACTAAC CACGCGTGTT 1980 

40 GTTCCTATCT GCTTTTCTTT CATTTTCAAT TCTGGTTATA TCAAGAAAAG AATAATCTAC 2040 

AATAATAAAC GGCATTTTTT TTTGAAAAAA AAAAAAAAAA AA 2082 



45 

(2) INFORMATIC3N FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 

CAGGCAGGCT GGCCCCGGGG ACTTCTCTCT GGCCCTGCTC CCTCCGAGCG CTCCGCCGTT 60 
GCCCGCCTGG CCCCTACGGA GTCCTTAGCC AGGATGGAGG CTGTTGTGAA CTTGTACCAA 120 

60 
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GAGGTGATGA AGCACGCAGA TCCCCGGATC CAGGGCTACC CTCTGATGGG GTCCCCCTTG 180 

CTAATGACCT CCATTCTCCT GACCTAOGTG TACTTCGTTC TCTCACTTGG GCCTCGCATC 240 

5 ATGGCTAATC GGAAC3CCCTT CCAGCTCCXST GGCTTCATGA TTGTCTACAA CTTCTCACTG 300 

GTCGCACTCT CCCTCTACAT TGTCTATGAG TTCCTGATGT CGGGCTGGCT GAGCACCTAT 360 

ACCTQGCX3CT GTGACCCTGT GGACTATTCC AACAGCCCTG AGGCACTTAG GATGGTTCGG 420 

10 

GTGGCCTGGC TCTTCCTCTT CTCCAAGTTC ATTGAGCTGA TGGACACAGT GATCTTTATT 480 

CTCCGAAAGA AAGACGGGCA GGTGACCTTC CTACATGTCT TCCATCACTC TGTGCTTCCC 540 

15 TGGAGCTGGT GGTGGGGGGT AAAGATTCCC CCGGGAGGAA TGGGCTCTTT CCATGCCATG 600 

ATAAACTCTT CCGTGCATGT CATAATGTAC CTGTACTACG GATTATCTGC CTTTGGCCCT 660 

GTGGCACAAC CCTACCTTTG GTGGAAAAAG CACATGACAG CCATTCAGCT GATCCAGTTT 720 

20 

GTCCTGGTCT CACTGCACAT CTCCCAGTAC TACTTTATGT CCAGCTGTAA CTACCAGTAC 780 

CCAGTCATTA TTCACCTCAT CTGGATGTAT GGCACCATCT TCTTCATGCT GTTCTCCAAC 840 

25 TTCTGGTATC ACTCTTATAC CAAGGGCAAG CGGCTGCTCC GTGCACTTCA GCAAAATGGA 900 

GCTCCAQGTA TTGCCAAGGT CAAGGCCAAC TGAGAAGCAT GGCCTAGATA GGCGCCCACC 960 

TAAGTGCCTC AGGACTGCAC CTTAGGGCAG TGTCCGTCAG TGCCCTCTCC ACCTACACCT 1020 

30 

GTGACCAAGG CTTATGTGGT CAGGACTGAG CAGGGGACTG GCCCTCCCCT CCCXACAGCT 1080 

GCTCTACAGG GACCACGGCT TTGGTTCCTC ACCCACTTCC CCCGGGCAGC TCCAGGGATG 1140 

35 TGGCCTCATT GCTGTCTGCC ACTCCAGAGC TGGGGGCTAA AAGGGCTGTA CAGTTATTTC 1200 

CCCCTCCCTG CCTTAAAACT TGGGAGAGGA GCACTCAGGG CTGGCCCCAC AAAGGGTCTC 1260 

GTGGCCTTTT TCCTCACACA GAAGAGGTCA GCAATAATGT CACTGTGGAC CCAGTCTCAC 1320 

40 

TCCTCCACCC CACACACTGA AGCAGTAGCT TCTGGGCCAA AGGTCAGGGT GGGCGGGGGC 1380 

CTGGGAATAC AGCCTGTGGA GGCTGCITAC TCAACTTGTG TCTTAATTAA AAGTGACAGA 1440 

45 GGAAACCAAA AAAAAAAAAA AAAAACTCGA GGGGGGCCCG TA 1482 

50 (2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

60 GGCACGAGCG CCGTTGCCCG CCTGGCCCCT ACGGAGTCCT TAGCCAGGAT GGAGGCTGTT 60 
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GTGAACTTGT ACCAAGAGGT GATGAAGCAC GCAGATCCCC GGATCCAGGG CTACCCTCTC 
ATGGGGTCCC CXTTTGCTAAT GACCTCCATT CTCCTGACCT ACGTGTACTT CGTTCTCTCA 
CTTGGGCCTC GCATCATGGC TAATCGGAAG CXTCTTCCAGC TCXX3TGGCTT CATGATTOIC 
TACAACTTCT CACTGGTGGC ACTCTCCCTC TACATTGTCT ATGAGTTCCT GATGTCGGGC 
TGGCTGAGCA CCTATACCTG GCGCTGTGAC CCTCAGGACT GCACCTTAGG GCAGTGTCCG 
TCAGTCCCXrr CTCCAMCTAC ACCTGTGACC AAGGCTTATG TGGTCAGGAC TGAGCAGGGG 
ACTGGCCCTC CCCTCCCCAC AGCTGCTCTA CAGQGACCAC GGCTTOGGTT CCTCACCCAC 
TTCCCCCGGG CAGCTCCAGG GATGTGGCCT CATTGCTGTC TGCCACTCCA GAGCTCGGGG 
CTAAAAGGGC TGTACAGTTA TTTCCCCCTC CXTTGCCTTAA AACTTGGGAG AGGAGCACTC 
AGGGCTGGCC CCACAAAGGG TCTCGTGGCC OTTTTCCTGA CACAGAAGAG GTCAGCAATA 
ATGTCACTGT GGACCCAGTC TCACTCCTCC ACCCCACACA CTGAAGCAGT AGCTTCTCGG 
CCAAAGGTCA GGGTGGGCGG GGGCCTGGGA ATACAGCCTG TGGAGGCTQC TTACTCAACT 
TGTGTCTTAA TTAAAAGTGA CAGAGGAAAC CACX3AAAAAA AAAAAAAAAA AAAA 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
- (D)~l<>PoriOGyT Ixheaf " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

TTGAACTTTT AAAATTTTAG ATCAGCAAAC TCTAAGATCC TAGAATGGAA QCTGTTCCTC 

ATTTCTCCAT GCTCACCCTC CCAGGTCAGC GAGATGGTGA AGAAGCTGCA CGCGGCAACA 

CCACCAACGT TCGGAGTGGA CCTCATCAAT GAGCTTGTGG AGAACTTTGG CAGATGTCCC 

AAGTGGTCTG GTCGGCAAGC CTTTGTCTTT GTCTGCCAGA CTGTCATTGA GGATGACTCC 

CTTCCCATGG ACCAGTTTGC TGTGCATCTC ATGCCGCATC TGCTAACCTT AGCAAATGAC 

AGGGTTCCTA ACGTGCGAGT GCTGCTTGCA AAGACATTAA GACAAACTCT ACTAGAAAAA 

GACTATTTCT TQGCCTCTGC CAGCTGCCAC CAGGAGGCTG TGGAGCAGAC CATCATGGCT 

CTTCAGATGG ACCGTGACAG CGATGTCAAG TATTTTGCAA GCATCCACCC TGCCAGTACC 

AAAATCTCCG AAGATGCCAT GAGCACAGCG TCCTCAACCT ACTAGAAGGC TTGAATCTCG 

GTGTCTTTCC TGCTTCCATG AGAGCCGAGG TTCAGTGGGC ATTCGCCACG CATGTCACCT 
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GGGATAGCTT TCGGGGGAGG AC3AGACXTTC CTCTCCTGCG GACTTCATTG CAGGTGCAAG 660 

TTGCOTACAC CXAATACCAG 03ATTTCAAG AGTCAAGAGA AAGTACAGTA AACACTATTA 720 

TCTTATCTTG ACTTTAAGGG GAAATAATTT CTCAGAGGAT TATAATTGTC ACCGAAGCCT 780 

TAAATCCTTC TGTCTTCCTG ACTGAATGAA ACTTGAATTG GCAGAGCATT TTCCTTATGG 840 

AAGGGATGAG ATTCCCAGAG ACCTGCATTG CTTTCTCCTG GTTTTATTTA ACAATCGACA 900 

AATGAAATTC TTACAGCCTG AAGGCAGACG TGTGCCCAGA TGTGAAAGAG ACCTTCAGTA 960 

TCAGCCCTAA CTCTTCTCTC CCAGGAAGGA CTTGCTOGGC TCTGTCGCCA GCTCTCCAGC 1020 

15 CXZAGCCCTGT GTGTGAATCG TTTGTGACGT GTGCAAATGG GAAAGGAGGG GTTTTTACAT 1080 

CTCCTAAAGG ACCTGATGCC AACACAAGTA GGATTGACTT AAACTCTTAA GCGCAGCATA 1140 

TTGCTGTACA CATTTACAGA ATGGTTGCTG AGTGTCTGTG TCTGATITTT TCATGCTGGT 1200 

20 

CATGACCTGA AGGAAATTTA TTAGACGTAT AATGTATGTC TGGTGTTTTT AACTTGATCA 1260 

TGATCAGCTC TGAGGTGCAA CTTCTTCACA TACTGTACAT ACCTGTGACC ACTCTTGGGA 1320 

25 GTGCTGCAGT CTTTAATCAT GCTGTTTAAA CTGTTGTGGC ACAAGTTCTC rTGTCCAAAT 1380 

AAAATTTATT AATAAGATCT ATAGAGAGAG ATATATACAC TTTTGATTGr TTTCTAGATG 1440 

TCTACCAATA AATGCAATTT GTGACCTGTA TTAAAAAAAA NTAAAAAAAC TCGAGGGGGG 1500 

CCCGGTAC 1508 



30 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 255: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2514 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

GAGAGACTCA CACTTCTTTT CCATTATCAC TGACGATGTA. GTGGACATAG CAGGGGAAGA 60 

GCACCTACCT GTGTTGGTGA GGTTTGTTGA TGAATCTCAT AACCTAAGAG AGGAATTTAT 120 

50 AGGCTTCCTG CCTTATGAAG CCGATGCAGA AATTTTGGCT GTGAAATTTC ACACTATGAT 180 

AACTGAGAAG TGGGGATTAA ATATGGAGTA TTGTCGTGGC CAGGCTTACA TTGWIirrCTAG 240 

TGGATTTTCT TCCAAAATGA AAGTTGTTGC TTCTAGACTT TTAGAGAAAT ATCCCCAAGC 300 

TATCTACACA CTCTGCTCTT CCTGTGCCTT AAATATGTGG TTGGCAAAAT CAGTACCTGT 360 

TATGGGAGTA TCTGTTGCAT TAGGAACAAT TGAGGAAGTT TGTTCTTTTT TCCATCGATC 420 

60 ACCACAACTG CTTTTAGAAC TTGACAACGT AATTTCTGTT CTTTTTCAGA ACAGTAAAGA 480 
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AAGGGGTAAA GAACTGAAGG AAATCTGCCA TTCTCAGTGG ACAGGCAGGC ATGATGCTTT 540 

TGAAATTTTA GTGGAACTCC TGCAAGCACT TGTTTTATGT TTAGATGGTA TAAATAGTGA 600 

5 

CACAAATATT AGATGGAATA ACTATATAGC TGGCCGAGCA TTTGTACTCT GCAGTGCAGT 660 

GTCAGATrTT GATTTCATTG TTACTATTGT TGTTCTTAAA AATGTCCTAT CTTTTACAAG 720 

10 AGCCTTTGGG AAAAACCTCC AGGGGCAAAC CTCTGATGTC TTCTTTGCGG CCGGTAGCTT 780 

GACTGCAGTA CTGCATTCAC TCAACGAAGT GATTGGAAAA TATTGAAGTT TATCATGAAT 840 

TTTGGTTTGA GGAAGCCACA AATTTGGCAA CCAAACTTGA TATTCAAATG AAACTCCXTTG 900 

15 

GGAAATTCCG CAGAGCTCAC CAGGGTAACT TGGAATCTCA GCTAACCTCT GAGAGTTACT 960 

ATAAAGAAAC CCTAAGTGTC CCAACAGTGG AGCACATTAT TCAGGAACTT AAAGATATAT 1020 

20 TCTCAGAACA GCACCTCAAA GCTCTTAAAT GCTTATCTCT GGTACCCTCA GTCATGGGAC 1080 

AACTCAAATT CAATACGTCG GAGGAACACC ATGCTGACAT GTATAGAAGT GACTTACCCA 1140 

ATCCTGACAC GCTGTCAGCT GAGCTTCATT GTTGGAGAAT CAAATGGAAA CACAGGGGGA 1200 

25 

AAGATATAGA GCTTCCGTCC ACCATCTATG AAGCCCTCCA CCTGCCTGAC ATCAAGITTT 1260 

TTCCTAATGT GTATGCATTG CTGAAGGTCC TGTGTATTCT TCCTGTGATG AAGGTTGAGA 1320 

30 ATCAGCX3GTA TGAAAATGGA CGAAAGCGTC TTAAAGCATA TTTGAGGAAC ACTTTGACAG 1380 

ACCAAAQGTC AAGTAACTTG GCTTTGCTTA ACATAAATTT TGATATAAAA CACGACCTGG 1440 

ATTTAATGGT GGACACATAT ATTAAACTCT ATACAAGTAA GTCAGAGCTT CCTACAGATA 1500 

35 

ATTCCGAAAC TGTGGAAAAT ACCTAAGAGA CTTTTAAAAA TAGGCTTTCT TATATTTGAT 1560 

ATTTGGAAGA AAAAGCCGTA AGTGTATGTA GACCACTTAA TCACTAAATA TCTTTGCCTA 1620 

40 TAGGACTCCA TTGAATACAT TAGCCATTGA TAATCTACXTT GTTTAAATGG CCCCTGTTTG 1680 

AACTCTCAAG CTTTGAAGAC CTACCTGTTC TTCCAGAAGA GAACGTTGAA AGTGCCATGT 1740 

TTCCTTTTGC GTGATCTCTG TTGATGGCAC TCTGGAATTG TTTCAGTTAA GTCATTTTAG 1800 

45 

ACATAGCATT TATTATCACT GTGGATCTCT ACTTGTTGGG TGTTATGAAT TCTTTGAAGA 1860 

AATATATTTT GAAGAGGTGT GGGAGGAAGG AATACATTTT ATAAAATGTT GTAGTGAAGC 1920 

50 CCACAATTGA CCTTTGACTA ATAGGAGTTT TAAGTATGTT AAAAATCTAT ACTQGACAGT 1980 

TACAAGAAAT TACCGGAGAA AAGCTTGTGA GCTCACCAAA CAAGGATTTC AGTGTAGATT 2040 

TTGTCTTTCT TGAACTTAAA GAAACAAATG ACAAAGTTTG AATGGAAAAG CCTGCTGTTG 2100 

55 

TTCCACATCT CGTTGCTGTT TACATTCCTT TGTGGAGCCT ACATCTTCCT AAGCTTTTTA 2160 

GCAGGTATAT GTTGAACACT TCTGTTTCAT GGTTGAGACA GAATCAGAGG CCATGGATAC 2220 

60 TGACAACTGA TTTGTCTCTT TTTTTTCTCT GTCTTTTTCX: ATGACTCTTA TATACTGCCT 2280 
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CATCTTGATT TATAAGCAAA ACCTGGAAAA CCTACAAAAT AAGTGTTCTG GTTTATCTAG 2340 

AAAAATATGG AAAATATTGC TGTTATTTTT GGTGAAGAAA ATCAATTTTG TATAGriTAT 2400 

TTCAATCTAA ATAAAATGTG AATTTTGTTT AAAGCTTAGG CACATTATTT TTTGTGGGGT 2460 

CAAAACATTC TTC3TGTAAAT TCTCTTAAAC ATTTGATAAA CAGCTTCACA ATTC 2514 

(2) INFORMATION FOR SEQ ID NO: 256: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIjOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 256: 
CTGCCTTATG AAGCCGATGC AGAAATTTTG GCTGTGAAAT TTCACACTAT GATAACTGAG 60 
25 AAGTGGGGAT TAAATATGGA GTATTGTCGT GGCCAGGCTT ACATTGTCTC TAGTGGATTT 120 
TCTTCCAAAA TGAAAGTTGT TGCTTCTAGA CTTTTAGAGA AATATCCCCA AGCTATCTAC 180 
ACACTCTGCT CTTCCTGTGC CTTAAATATG TGGTTGGCAA AATCAGTACC TGTTATGGGA 240 

30 

GTATCTGTTG CATTAGGAAC AATTGAGGAA GTTTGTTCTT TTTTCCATCG ATCACCACAA 300 
CTGCTTTTAG AACTTGACAA CGTAATTYCT &TTCTTTTTC AGAACAGT AA AGAAAGGGGT 360 
35 AAAGAACTGA AGGAAATCTG CCATTCTCAG TGGACAGGCA GGCATGATGC TTTTGAAATT 420 
TTAGTGGAAC TCCTOCAAGC ACrTGTTTTA TCTTTAGATG GTATAAATAG TGACACft^ _ .480. 

ATTAGATGGA ATAACTATAT AGCTGGCCGA GCATTTGTAC TCTGCAGTGC AGTGTCAGAT 540 

40 

TTTGATTTCA TTGTTACTAT TGTTGTTCTT AAAAATGTCC TATCTTTTAC AAGAGCCTTT 600 
GGGAAAAACC TCCAGGGGCA AACCTCTGAT GTCTTCTTTG CGGCCGGTAG CTTGACTGCA 660 
45 GTACTGCATT CACTCAACGA AGTGANTGGA AAATATTGAA GTTTATCATG AATTTTGGTT 720 
TGAGGAAGCC ACAAATTTGG CAACCAAACT TGATATTCAA ATGAAACTCC CTGGGAAATT 780 
CCGCAGAGCT CACCAGGGTA ACTTGGAATC TCAGCTAACC TCTGAGAGTT ACTATAAAGA 840 

50 

AACCCTAAGT GTCCCAACAG TGGAGCACAT TATTCAGGAA CTTAAAGATA TATTCTCAGA 900 
ACAGCACCTC AAAGCTCTTA AATGCTTATC TCTGGTACCC TCAGTCATGG GACAACTCAA 960 
55 ATTCAATACG TCGGAGGAAC ACCATGCTGA CATGTATAGA AGTGACTTAC CCAATCCTGA 1020 
CACGCTGTCA GCTGAGCTTC ATTGTTGGAG AATCAAATGG AAACACAGGG GGAAAGATAT 1080 
'AGAGCTTCCG TCCACCATCT ATGAAGCCCT CCACCTGCCT GACATCAAGT TTmCCTAA 1140 

60 
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TGTCTATGCA TTCCTGAAGG TCXTCTGTAT TCTTCCTGTG ATGAAGGTTG. AGAATGAGCG . 1200 

GTATGAAAAT GGACGAAAGC GTCTTAAAGC ATATTTGAGG AACACTTTGA CAGACCAAAG 1260 

5 GTCAAGTAAC TTGGCTTTGC TTAACATAAA TTTTGATATA AAACACGACX: TGGATTTAAT 1320 

GGTGGACACA TATATTAAAC TCTATACAAG TAAGTCAGAG CTTCCTACAG ATAATTCCGA 1380 

AACTGTGGAA AATACCTAAG AGACmTAA AAATAGGCTT TCTTATATTT GATATTTGGA 1440 

10 

AGAAAAAGCC GTAAGTGTAT GTAGACCACT TAATCACTAA ATATCTTTGC CTATAGGACT 1500 

CCATTGAATA CATTAGCXIAT TGATAATCTA CCTGTTTAAA TGGCXrCCTGT TTGAACTCTC 1560 

15 AAGCTTTGAA GACCTACCTG TTCTTCCAGA AGAGAACGTT GAAAGTGCCA TGTTTCCTTT 1620 

TGCGTGATCT CTGTTGATGG CACTCTGGAA TTGOTTCAGT TAAGTCATTT TAGACATAGC 1680 

ATTTATTATC ACTGTGGATC TCTACTTGTT GGGTGTTATG AATTCTTTGA AGAAATATAT 1740 

20 

TTTGAAGAGG TGTGGGAGGA AGGAATACAT TTTATAAAAT GTTGTAGTGA AGCCCACAAT 1800 

TGACCTTTGA CTAATAGGAG TTTTAAGTAT GrTAAAAATC TATACTGGAC AGTTACAAGA 1860 

25 AATTACCGGA GAAAAGCTTG TGAQCTCACC AAACAAGGAT TTCAGTGTAG ATTTTGTCIT 1920 

TCTTGAACTT AAAGAAACAA ATGACAAAGT TTGAATGGAA AAGCCTGCTG TTGTTCCACA 1980 

TCTCGTTGCT GTTTACATTC CTTTGTGGAG CCTACATCTT CCTAAGCTTT TTAGCAGGTA 2040 

30 

TATGTTGAAC ACTTCTGTTT CATQGTTGAG ACAGAATCAG AGGCCATGGA TACTGACAAC 2100 

TGATTTGTCT GTTTTTTTTC TCTGTCTTTT TCCATGACTC TTATATACTG CCTCATCTTG 2160 

35 ATTTATAAGC AAAACCTGGA AAACCTACAA AATAAGTGTT GTGGTTTATC TAGAAAAATA 2220 

OXSGAAAATAT TGCTGTTATT TTTGGTGAAG AAAATCAATT TTGTATAGTT TATTTCAATC 2280 

TAAATAAAAT GTGAATTTTG TOTAAAGCTT AGGCACATTA TTTTTTGTGG GGTCAAAACA 2340 

40 

TTCTTGTGTA AATTCTC 2357 



45 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I^ENGTH: 689 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEENESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

55 

ACTTTCTGGT GCAAAAAGAT GTTCAAGCCT TATTTTATAC TTGCCTGCCC CTTTCTCTTT 60 
CATTTATTGG AGTGAGCTGC AGCTCTAAGA AGACCTGTTC TTTTGAATGG AGAGTAGCAT 120 
60 CAGGAACCAG GATGTGGGTG CGAGGCGTGC TCCTGGCTGT TGCAGATTGC TGCACCCGGG ISO 
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AGCrCTTAGT GGACAGAGCT AGAGGATATG TGCACGTACT TCCATCTCTC TCTCTCTCTC 240 

CGATTTTAGC CCAGCACX:AC AGGGTACX3TT CCAGTTTTTC TCTCTTTCCA TAGCTCTAAG 300 

5 

GCCCTTTCTG GGAATGGTTC TCATTCTCCT TAATCTATTA TTGQGTCAGT TTTCCTGCAT 360 

GTCCCX7W3CX: TCCCATCACT GCCACCCACT CCCCACAGAG ATGCCCTGCT CATCXXACTG 420 

10 GGGCTTTGAC TCCCACACTG TGTACCCCTC TTGTGTQGAC GCCCTGCTGC CAAAACCTTC 480 

AGCAAACAGC TTTCCAAATG GAAGTTGTCA CTGTCARGGS CTTTACAATC AGCAACAGCA 540 

AAATCTACAT GCTGCTGAGG GTCCTGCCTC ATTAAGATGC AATAAATATG TAAGTACATA 600 

AAAACAGCAA TAGAAGAAAC GTAATGCTTT ATTCTCAAAT ATGNATGTCT ACATAGAAAA 660 

GCXIAAAATTA TTAAGAATAG TAAOGAATT 689 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 258: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
TCGACCCACG CGTCCGCCGA TGTGATGATT CCTGCGTATT CCAAGAACCG GGCCTATGCC 60 
35 ATCTTCTTCA TAGTCTTCAC TGTGATAGGG GACGCCCCCG GCGCTGTGCT ATCCTGTGCC 120 
GGCCACCCTT GCGITGCTTT TGCrcCTGTA CTGGTGGCGC CCCTGACCGT GGCTGTCTCC 180 
TCTTGAAGGA AGCCTGTTTC TGATGAACCT GCTGACAGCC ATCATCTACA GTCAGTTCCG 240 

40 

GGGCTACCTG ATGAAATCTC TCCAGACCTC GCTGTTTCGG AGGCGGGTGG GAACCCGGCT 300 
GCCTTTGAAG TCCTATCCTC CATGGTGGGG GAGGGAGGAG CCTTCCCTCA GGCAGTTGGG 360 
45 GTGAAGCCCC AGAACTTGCT GCAGGTGCTT CAGAAGGTCC AGCTGGACAG CTCCCACAGA 420 
CAGGCCATGA TGGAGAAGGT GCGTTCCTAT GGCAGTGTTC TGCTCTCAGC TGAGGAGTTT 480 
CAGAAGCTCT TCAACGAGCT TGACAGAAGT GTGGTTAAAG AGCACCCGCC GAGGCCCGAG 540 

50 

TACCAGTCTC CGTTTCTGCA GAGCGNCCCA GTTCCTCTTC GGCCACTNAC TACTTTGACT 600 
ACCTGGGGAA CCTCATCGCC CTGGCAAACC TGGTGTCCAT TTGCGTGTTC CTGGTGCTGG 660 
55 ATGCAGATGT TGCTGCCTGC TGAGCGTGAT GACTTCATCC TGGGGGGTCT CAACTGCGTC 720 
TTCATTGTGT ACTACCTGTT GGAGATGCTG GCTCAAGGTC TTTTGCCCTG GGGCCTGCGA 780 
RGGTACYKKT CCTAACCCCA RCAAMGTGTT TTGAACGGGC TCCTCAMCGT TTGTCCTGGC 840 



60 
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TGGWWKKGSM GATCTCAACT CTGGCTGTGT ACCGATTGCC ACACCCAGGC TGGAGGCCGG 900 

ANATGGTGGG CCTGCTGTCG CTGTGGGACA TGACCCGCAT ACTGAACATG CTCATCGrGT 960 

5 TCCGCTTCCT GCGTATCATC CCCAGCATGA AGCCGATGGC CGTGGTOGCC AE3TACCGTCC 1020 

TGGGCCTGGT GCAAAACATG CGTGCGTTTG GCXSGGATCCT GGTGGTOGTC TACTACGTAT 1080 

TTGCCATCAT TGGGATCAAC TTGTTTAGAG GCGTCATTGT GGCTCTTCCT GGAAACAGCA 1140 

10 

GCCTGGCCCC TGCCAATAGG TCGGCGCCCT GTGGGAGCTT CGAGCAGCTG GAGTACTGGG 1200 

CCAACAACTT CGATGACTTT GCGGCTGCCC TGGTCACTCT GTGGAACTTG ATGGTGGTGA 1260 

15 ACAACTGGCA GGTGTTTCTG GATGCATATC GGCGCTACTA AGGCCCXyTGG TCCAAGATCT 1320 

ATTTTGTATT GTGGTGGCTG GTGTCGTCTG TCATCTGGGT CAACXrrGTTT CTGGCCCTGA 1380 

TTCTGGAGAA CTTCCTTCAC AAGTGGGACC CCCGCAGCCA CXTTGCAGCCC CTTGCTGGGA 1440 

20 

CCCCAGAGGC CACCTACCAG ATGACTGTGG AGCTCCTGTT CAGGGATATT CTGGAGGAGC 1500 

CCGGGGAGGA TGAGCTCACA GAGAGGCTGA GCCAGCACCC GCACCTGTGG CTGTGCAGGT 1560 

25 GACGTCCGGG TCTGCCATCC CAGCAGGGGC GGCAGGAGAG AGAGGCTGGC ATAACACAGG 1620 

TGCCCATCAT GGAAGAGGCX5 GCCATGCTGT GGCCAGCCAG GCAGGAAGAG ACCTTTCCTC 1680 

TGACGGACCA CTAAGCTGGG GACAGGAACC AAGTCCTTTG CGTGTGGCCC AACAACCATT 1740 

30 

TACAGAACAG CTGCTGGTGC TTCAGGGAGG CGCCGTGCCC TCCXXTTTTCT TTTATAGCTG 1800 

CTTCAGTGAG AATTCCCTTG TCGACTCCAC AGGGACCTTT CAGACAAAAA TGCAAGAAGC 1860 

35 AGCGGCCTCC CCTGTCCCCT GCAGCTTCCG TGGTGCCTTT GCTGCCQGCA GCCXTTTOGGG 1920 

AGCACAGGCC TGACCAGGGC CnSCACAGGT TAACCGTCAG ACTTCCGGGG CATTCJ^GCTG 1980 

GGAATGATAC TAATACCTCC GATTTTAGCC CAGCACCACA GGGTACGTTC CAGTTTTTAT 2040 

40 

TTCTTTCCAT AGCTCTAAGG CCCTTTCTGG GAATGGTTAT CATTCTCCTT AATCTATTAT 2100 

TGGGTCAGTT TTCCTGCATG TCCCCAGCCT CCCATCACTG CCACCCACTC CCCACAGAGA 2160 

45 TGCCCTGCTC ATCCX3ACTGG GGCTTTGACT CCCACACTGT GTACCX:CTCT TGTGTGGACG 2220 

CCCTGCTGCC AAAACCTTCA GCAAACAGCT TTCCAAATGG AAGTTGTGAC TGTCAGGGCC 2280 

TTTACAATCA GCAACAGCAA AATCTACATG CTGCTGAGGG TCCTGCCTCA TTAAGATGCA 2340 

50 

ATAAATATGT AAGTACATAA AAAAAAAAAA AAAAAAA 2377 



55 

(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STOANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 259: 

5 

TCTGNTCGCC GTCGCCCCGC CCCTGGCCTT TGCCXX3GTCG GGCGGGACTT CCTGTCTCGT 60 
ATTTCCAAGG ACTCCAAAGC GAGGCCGGGG ACTGAAGGTG TGGGTGTCGA GCCCTCTGGC 120 



10 AGAGGGTTAA CCTGGGTCAA ATGCACGGAT TCTCACCTCG TACAGTTACX3 CTCTCCCX3CX3 180 



GCACXTTCCGC GAGGMYTTGA AGTCCTGAGC GCTCAAGTTT GTCCGTAGTC GAGAGAAGGC 240 



CATGGAGGTG CCGCCACCGG CACCGCGGAG CrTTCTCTGT AGAGCATTGT GCXTTATTTCC 300 

15 

CXX5AGTCTTT GCTGCCGAAG CTGTGACTGC CGATTCGGAA GTCCTTGAGG AGCGTCAGAA 360 



GCGGCTTCCC TACGTCCCAG AGCCCTATTA CCCGGAATCT GGATGGGACC GCCTCCGGGA 420 



20 GCTGTTTGGC AAAGACACAG TGAACACTAG TCTGAATGTA TACCGAAATA AAGATGCCTT 480 



AAGCCATTTT GTAATTGCAG GAGCTGTCAC GGGAAGTCTT TTTAGGATAA ACGTAGGCCT 540 



GCGTGGCTGG TGGCTGGTGG CATAATTGGA GCCTTGCTGG GCACTCCTGT AGGAGGCCTG 600 

25 

CTGATGGCAT TTCAGAAGTA CTCTGGTGAG AdCTTCAGG AAAGAAAACA GAAGGATCGA 660 



AAGGCACTCC ATGAGCTAAA ACTGGAAGAG TGGAAAGGCA GACTACAAGT TACTGAGCAC 720 



30 CTCCCTGAGA AAATTGAAAG TAGTTTACAG GAAGATGAAC CTGAGAATGA TGCTAAGAAA 780 
ATTGAAGCAC TGCTAAACCT TCCTAGAAAC CCTTCAGTAA TAGATAAACA AGACAAGGAC 840 



TGAAAGTGCT CTGAACTTGA AACTCACTGG AGAGCTGAAG GGAGCTGCCA TGTCCGATGA 900 

35 

ATGCCAACAG ACAGGCXyVCT CTTTGGTCAG CCTGCTGACA AATTTAAGTG CTGGTACCTG 960 



TGGTGGCAGT GGCTTGCTCT TGTCTTTTTC ' X ' l ' r i V lTrr r AACTAAGAAT GGGGCTGTTG 1020 



40 TACTCTCACT TTACTTATCC TTAAATTTAA ATACATACTT ATGTTTGTAT TAATCTATCA 1080 



ATATATGCAT ACATGAATAT ATCCACCCAC CTAGATTTTA AGCAGTAAAT AAAACATTTC 1140 



GCAAAAGATT AAAGTTGAAT TTTACAGTTA AAAAAAAAAA AAAAAAAAAA AAA 1193 

45 



50 



(2) INFORMATION FOR SEQ ID NO: 260: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
55 (D) TOPOUX^: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 
GAAAAACCCA AAGATGCAGA CAATCTCTTT GAACATGAAT TGGGGGCTCT CAATATGGCT 60 
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GCATTACTAC GAAAAGAAGA AAGAGCAAGT CTTCTTAGTA ATCTTGGCCC ATGTTGTAAG 120 

GCGTTCTGCT TCAGACGGGA TTCTGCAATT CGAAAGCAGC TTGTTAAAAA TGAGAAGGGC 180 

5 ACCATAAAAC AAGCTTACAC GAGTSCTCCA ATGGTAGACA ATGAATTACT TCGATTGAGT 240 

CTTCX3GTTAT TTAAGCGGAA GACTACTTGC CATGCTCCAG GACATGAAAA GACTGAAGAT 300 



AATAAACTTT CACAGTCCAG TATCCAACAG GAACTGTGTC TGTCTTAAGA CCGAAGTTCA 360 

10 

ATATGGTATT TrTGGTACTG TCTTCCTTCA GCAGTGCATA TTCTTTTGCA AAGTTCTTTG 420 

GTTTGACAAG CATTAGTGAC AAAGGCAGAA AAGATTTATC AGCCATGCTA AAAGAGTGAA 480 

15 GAATTTTGAT CTTTAGAGAC ACTAGTTTTG GCCAACTTAA GATTTTACGT TAATTTTTAC' 540 

ATAGTATTTG ACACTCATGC AAAATAATGT GAAAACATCT AGATTTAGTA GTTTATTCTG 600 

CGCCTTTrGT TAAAACTGAA GATTTTGGAA AATGGTTGTC ACTGCTCTTC CAGCCTATGA 660 

20 

ATATTTTTGT GAAATGGAAC CATGGATTTA TGTCTGGATC ATCCATACAG AACCAACAAT 720 

TTTATTCAAA AACAATGTGT TCATCAAAGT AATTGCTCAC ATTGTGCAGT ACTATGTTGT 780 

25 ACAGACCACG TGAAAGGGAA TGCTCGTCTA GCTGGCGTGG TATGTTTATA GGCGAATTTC 840 

AGCAGAAGGA AGCCAAAATA GTTTTTTCCT TTTGAAAGTT TTTTAAAAAT TATTTCATGG 900 

(j IKJ'ri'i''iTlU' TAATTAATAT GTGTGCATTG TTACAATGTA TGTTQGATGT CTTTTGACCX: 960 

30 

TAAATGCTTT TTTTGTTATC AGAGATTGTG TACTATTTTT ATTTTTAATA AATGTATCTT 1020 

CCCTTTCCTT GnTTAGATT TACTTTGCTC TTCGTTAATC TTATTCCTGA TGATCTAGAA 1080 

35 CATTAGTCAT CAACATTACA TGTTTCATGC 1TCAGATATT TTACTGCTTG TGTCCTTATT 1140 

GTTGGACAGC TTTAAACAGA GTTGATGGTA CTTCAAATAT AGCTCATTGA TACTTAAGGG 1200 

CANCTTCCTT GGGATGTGGG CTTTTTGGAA GGAAAAAAAT TNCCCCAAAG GCAAATCCCA 1260 

40 

GT 1262 



45 

(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1179 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPODOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

55 

GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 60 
GGGTTGCGNC GGCGCCCTGG CCCGAAGAAG CGCAATTGGC GTTCCGCGAA CGTTGGCCCT 120 
60 CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTGGG CTACAAGGAC 180 
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CTGGACCTCA TCTTCCTGCG (XGACCTGCG CGGGGAAGGG GAGTTTCAGA CTGTGAAGGA 240 

CGTCGTGCTG GACTGCXTTCT TGGACTTCTT ACCCGAGGGG GTGAACAAAG AGAAGATCAC 300 

5 

ACCACTCACG CTCAAGGAAG CITATGTGCA GAAAATGGTT AAAGTGTGCA ATGACTCTGA 360 

CXX3ATGGAGT CTTATATCCC TGTCAAACAA CAGTGGCAAA AATGTGGAAC TGAAATTTGT 420 

10 GGATTCCCTC CGGAGGCAGT TTGAATTCAG TGTAGATTCT TTTCAAATCA AATTAGACTC 480 

TCTTCTGCTC TTTTATGAAT GTTCAGAGAA CCCAATGACT GAGACATTTC ACCCXIACAAT 540 

AATCGGGGAG AGCGTCTATG GCGATTTCCA GGAAGCXTTTT GATGACXTTTT GTAACAAGAT 600 

15 

CATTGCCACC AGGAACCCAG AGGAAATCCG AGGGGGAGGC CTGCTTAAGT ACTGCAACCT 660 

CTTGGTGAGG GGCTTTAGGC CCGCCTCTGA TGAAATCAAG ACCCTTCAAA GGTATATGTG 720 

20 TTCCAGGTTT TTCATCGACT TCTCAGACAT TGGAGAGCAG CAGAGAAAAC TGGAGTCCTA 780 

•TITGCAGAAC CACTTTGTGG GATTGGAAGA CCGCAAGTAT GAGTATCTCA TGACCCTTCA 840 

TGGAGTGGTA AATGAGAGCA CAGTGTGCCT GATGGGACAT GAAAGAAGAC AGACTTTAAA 900 

CCTTATCACC ATGCTGGCTA TCCGGGTGTT AGCTGACX:AA AATGTCATTC CTAATGTGGC 960 

TAATGTCACT TGCTATTACC AGCCAGCCCC CTATGTAGCA GATGCXAACT TTAGCAATTA 1020 

30 CTACATTGCA CAGGTTCAGC CAGTATTCAC GTGCCAGCAA CAGACCTACT CCACTTGGCT 1080 

ACCCTGCAAT TAAGAATCAT TTAAAAATGT CCTGTGGGGA AGCCATTTCA GACAAGACAG 1140 

GAGAGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAGAGC 1179 

35 



(2) INFORMATION FOR SEQ ID NO: 262: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 60 

50 

GGGTTGCGNC GGCGCCCTGG CCCGAAGAAG CGCAATTGGC GTTCCGCGAA CGTTGGCCCT 120 

CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTGQG CTACAAGGAC 180 

55 CTGGACCTCA TCTTCCTGCG CCGACCTGCG CGGGGAAGGG GAGTTTCAGA CTGTGAAGGA 240 

CGTCGTGCTG GACTGCCTGT TGGACTTCTT ACCCGAGGGG GTGAACAAAG AGAAGATCAC 300 

ACCACTCACG CTCAAGGAAG CTTATGTGCA GAAAATGGTT AAAGTGTGCA ATGACTCTGA 360 

60 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



CCGATGGAGT 
GGATTCCCTC 
TCTTCTGCTC 
AATCGGGGAG 
CATTGCTACC 
CTTGGTGAGG 
TTCCAGGTTT 
TTTGCAGAAC 
TGGAGTGGTA 
CCTTATCACC 
TAATGTCACT 
CTACATTGCA 
ACCCTGCAAT 
GAGAGAAAAA 



CTTATATCCC TGTCAAACAA 
CGGAGGCAGT TTGAATTCAG 
TTTTATGAAT GTTCAGAGAA 
AGCGTCTATC GCGATTTCCA 
AGGAACCCAG AGGAAATCCG 
GGCTTTAGGC CCGCCTCTGA 
TTCATCGACT TCTCAGACAT 
CACTTTGTGG GATTGGAAGA 
AATGAGAGCA CAGTCTGCCT 
ATGCTGGCTA TCCGGGTGTT 
TGCTATTACC AGCCAGCCCC 
CAGGTTCAGC CAGTATTCAC 
TAAGAATCAT TTAAAAATGT 
NAANGAAAAG AG 



CAGTGGCAAA AATGTGGAAC 
TGTAGATTCT TTTCAAATCA 
CCCAATGACT GAGACATTTC 
GGAAGCCTTT GATCACCTTT 
AGGGGGAGGC CTGCTTAAGT 
TGAAATCAAG ACCCTTCAAA 
TGGAGAGCAG CAGAGAAAAC 
CCGCAAGTAT GAGTATCTCA 
GATGGGACAT GAAAGAAGAC 
AGCTGAOCAA AATGTrCATTC 
CTATGTAGCA GATGCCAACT 
GTGCCAGCAA CAGACCTACT 
CCTGTGGGGA AGCCATTTCA 



TGAAATTTGT 
AATTAGACTC 
ACCCCACAAT 
GTAACAAGAT 
ACTGCAACCT 
GGTATATGTG 
TGGAGTCCTA 
TCACX:CTTCA 
AGACTTTAAA 
CTAATGTGGC 
TTAGCAATTA 
CCACTTGGCT 
GACAAGACAG 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 
(B> TYPE: nucleic acid 
(Cj STRANDEDNESS : double 
(D) TOPOtOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

CGGGCTGGGT ATTTGCCTCG CACCATGGCG CCCAAGGGCA AAGTGGGCAC GAGAGGGAAG 

AAGCAGATAT TTGAAGAGAA CAGAGAGACT CTGAAGTTCT ACCTGCGGAT CATACTGGGG 

GCCAATGCCA TTTACTGCCT TGTGACGTTG GTCTTCTTTT ACTCATCTGC CTCATTTTGG 

GCCTGGTTGG CCTTGGGCTT TAGTCTGGCA GTGTATGGGG CCAGCTACCA CTCTATGAGC 

TCGATGGCAC GAGCAGCGTT COTCTGAGGA TGGGGCCCTG ATGGATGGTG GCACGAGCTC 

AACATGGAGC AGGGCATGGC AGAGCACCTT AAGGATGTGA TCCTACTGAC AGCCATCGTG 

CAGGTGCTCA GCTGCTTCTC TCTCTATGTC TGGTCCTTCT GGCTTCTGGC TCCAGGCCGG 

GCCCTTTACC TCCTGTGGGT GAATGTGCTG GGCCCCTGGT TCACTGCAGA CAGTGGCACC 

CCAGCACCAG AGCACAATGA GAAACGGCAG CGCCGACAGG AGCGGCGGCA GATGAAGCGG 

TTATAGCCAT TGACATTGTG GCCACAGGCC ACTGGCCCTG GGTGGCTCTG TCAGGCTGCA 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1162 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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10 



20 



50 



CAGCCCCTCA TGCCTGGAGC AATGAGGGTC TAGTCCAGGG GCCAAAAGCA GTCTGAQGTA 660 

TTGGCTATAC TTATACTCTA TAGGGTCGTT GAATAAATGG CTTAGAATGT GAAAAAAAAA 720 

AAAAAAAAAA ATTTT 735 

(2) INFORMATION FOR SEQ ID NO: 264: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

AAGTGCATGA GCTGCCGATG TGGTGCTTAG TGATTGCGGT TTCGGTCGCT CTCCCGTGTT 60 

TCCCQGGCTG GGTATTTGCC TCGCACCATG GCGCCCAAGG GCAAAGTGGG CACGAGAGGG 120 

25 AAGAAGCAGA TATTTGAAGA GAACAGAGAG ACTCTGAAGT TCTACCTGCG GATCATACTG 180 
GGGGCCAATG CCATTTACTG CCTTGTGACG TTGGTCTTCT TTTACTCATC TGCCTCATTT > 240 

TGGGCCTGGT TGGCCTGGGC TTTAGTCTGG CAGTGTATGG GGCCAGCTAC CACTCTATGA 300 

30 

GCTCGATGGC ACGAGCAGCG TTCTCTGAGG ATGGGGCCCT GATGGATGGT GGCATGGACC 360 

TCAACATGGA GCAGGGCATG GCAGAGTGAG TGTCCCCCAC CGCCAGCCCA GGCACCTTAA 420 

35 GGATGTGATC CTACTGACAG CCATCGTGCA GGTGCTCAGC TGCTTCTCTC TCTATGTCTG 480 

GTCCTTCTGG CTTCTGGCTC, CAGGCCGGGC CCTTTACCTC CTGTGGGTGA ATGTGCTGGG 540 

CCCCTGGTTC ACTGCAGACA GTGGCACCCC AGCACCAGAG CACAATGAGA AACGGCAGCG 600 

40 

CCGACAGGAG CGGCGGCAGA TGAAGCGGTT ATAGCCATTG ACGATTTKGC SACNRGCCAC 660 

TGGCCCTGGG TGGCTCTGTC AGGGTGCACA GCCCCTCATG CCTGGAGCAA TGAGGGTCTA 720 

45 GTCCAGGGGC CAAAAGCAGT CTGAGGTATT GGGTATACTT ATACTCTATA GGGTCGTTGA 780 

ATA 783 



<2) INFORMATION FOR SEQ ID NO: 265: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LHTCTH: 1638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTICMsT: SEQ ID NO: 265: 
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GGCACGAGGC GGCGGCAGCG GTGGCGGCGG CGCCCCCXGG CXX3GAGCXX?r NCCXTTTCCC 60 

GTCGGGGAGC GCX3GGGYCGG GGYCCAGGGG ANCCXTGQGMC ACGGAGAGCG GGAAGAGGAT 120 

5 

GGATTGCCCG GCCCTCCCCC CXX3GATGGAA GAAGGAGGAA GTGATCCX3AA AATCTGGGCT 180 

AAGTGCTGGC AAGAGCGATG TCTACTACTT CAGTCCAACT GGTAAGAAGT TCAGAAGCAA 240 

10 GCCTCAGTTG GCAAGGTACC TQGGAAATAC TGTTGATCTC AGCAGTTTTC ACTTCAGAAC 300 

TGGAAAGATG ATGCCTAGTA AATTACAGAA GAACAAACAG AGACTGCGAA ACGATCCTCT 360 

CAATCAAAAT AAGGGTAAAC CAGACTTGAA TACAACATTG CCAATTAGAC AAACAGCATC 420 

15 

AATTTTCAAA CAACCGGTAA CCAAAGTCAC AAATCATCCT AGTAATAAAG TGAAATCAGA 480 

CCCACAACGA ATGAATGAAC AGCCACGTCA GCTTTTCTGG GAGAAGAGGC TACAAGGACT 540 

20 TAGTGCATCA GATGTAACAG AACAAATTAT AAAAACCATG GAACTACCCA AAGGTCTTCA 600 

AGGAGTTGGT CCAGGTAGCA ATGATGAGAC CCTTTTATCT GCTGTTGCX:A GTGCTTTGCA 660 

CACAAGCTCT GCGCXAATCA CAGGGCAAGT CTCCGCTGCT GTGGAAAAGA ACCCTGCTGT 720 

25 

TTGGCTTAAC ACATCTCAAC CCCTCTGCAA AGCTTTTATT GTCACAGATG AAGACATCAG 780 

GAAACAGGAA GAGCGAGTAC AGCAAGTACG CAAGAAATTG GAAGAAGCAC TGATGGCAGA 840 

30 CATCTTGTCG CGAGCTGCTG ATACAGAAGA GATGGATATT GAAATGGACA GTGGAGATGA 900 

AGCCTAAGAA TATGATCAGG TAACTTTCGA CCGACTTTCC CCAAGAGAAA ATTCCTAGAA 960 

ATTGAACAAA AATGTTTCCA CIXX3CTTTTG CXTTOTAAGAA AAAAAATGTA CCCGAGCACA 1020 

35 

TAGAGCTTTT TAATAGCACT AACCAATGCC TTTTTAGATG TATTTTTGAT GTATATATCT 1080 

ATTATTCAAA AAATCATGTT TATTTTGAGT CCTAGGACTT AAAATTAGTC TTTrGTAATA 1140 

40 TCAAGCAGGA CCCTAAGATG AAGCTGAGCT TTTGATQCCA GGTGCAATCT ACTGGAAATG 1200 

TAGCACTTAC GTAAAACATT TGTTTCCCCC ACAGTTTTAA TAAGAACAGA TCAGGAATTC 1260 

TAAATAAATT TCCCAGTTAA AGATTATTGT GACTTCACTG TATATAAACA TATTTTTATA 1320 

45 

CTTTATTGAA AGGGGACACC TGTACATTCT TCCATCRTCA CTGTAAAGAC AAATAAATGA 1380 

TTATATTCAC AGACTGATTG GAATTCTTTC TGTTGAAAAG CACACACAAT AAAGAACCCC 1440 

50 TCGTTAGCCT TCCTCTGATT TACATTCAAC TCTGATCCCG GGGCCTTAGG TTTGACATGG 1500 

GAGGTGGGAG GAAGATAGCG CATATATTTG CAGTATGAAC TATTGCCTCT GGGACGTTGT 1560 

GAGGAATTGT GCTTTCACCA GAATTTCTAA GGATTTCTGG CTTAAATATC ACCTAGCXTTG 1620 

55 

TGGTAATTTT TTTTCCCT 1638 



60 



wo 98/39448 



PCT/US98/04493 



493 



(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LiENGTH: 1455 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEnNESS : double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

10 

CGTGCGTACT GCCATGCAGG TACCGGGTCC GGAATTCCCA GGGTCGACCC ACGCGTCCGC 60 

TCAGTTGGCA AGGTACCTGG GAAATACTC3T TGATCTCAGC AGTTTTGACT TCAGAACTGG 120 

15 AAAGATGATG CCTAGTAAAT TACAGAAGAA CAAACAGAGA CTGCGAAACG ATCCTCTCAA 180 

TCAAAATAAG GGTAAACCAG ACTTGAATAC AACATTGCCA ATTAGACAAA CAGCATCAAT 240 

TTTCAAACAA CCGGTAACCA AAGTCACAAA TCATCCTAGT AATAAAGTGA AATCAGACCC 300 

20 

ACAACGAATG AATGAACAGC CACGTCAGCT TTTCTQGGAG AAGAGGCTAC AAGGACTTAG 360 

TGCATCAGAT GTAACAGAAC AAATTATAAA AACCATGGAA CTACCCAAAG GTCTTCAAGG 420 

25 AGTTGGTCCA GGTAGCAATG ATGAGACCCT TTTATCTGCT GTTGCCAGTG CTTTGCACAC 480 

AAGCTCTGCG CCAATCACAG GGCAAGTCTC CGCTGCTGTG GAAAAGAACC ' CTGCTGTTTG 540 

GCTTAACACA TCTCAACCCC TCTGCAAAGC TTTTATTGTC ACAGATGAAG ACATCAGGAA 600 

30 

ACAGGAAGAG CGAGTACAGC AAGTACGCAA GAAATTGGAA GAAGCACTGA TGGCAGACAT 660 

CTTGTCGCGA GCTGCTGATA CAGAAGAGAT GGATATTGAA ATGGACAGTG GAGATGAAGC 720 

35 CTAAGAATAT GATCAGGTAA CTTTCGACCG ACTTTCCCCA AGAGAAAATT CCTAGAAATT 780 

GAACAAAAAT GTTTCCACTG GCTTTTGCCT. GTAAGAAAAA AAATGTACCC GAGCACATAG 840 

AGCTTTTTAA TAGCACTAAC CAATGCCTTT TTAGATGTAT TTTTGATGTA TATATCTATT 900 

40 

ATTCAAAAAA TCATGTTTAT TTTGAGTCCT AGGACTTAAA ATTAGTCTTT TGTAATATCA 960 

AGCAGGACCC TAAGATGAAG CTGAGCTTTT GATGCCAGGT GCAATCTACT GGAAATOTAG 1020 

45 CACTTACGTA AAACATTTGT TTCCCCCACA GTTTTAATAA GAACAGATCA GGAATTCTAA 1080 

ATAAATTTCC CAGTTAAAGA TTATTGTGAC TTCACTGTAT ATAAACATAT TTTTATACTT 1140 

TATTGAAAGG* GGACACCTGT ACATTCTTCC ATCRTCACTG TAAAGACAAA TAAATGATTA 1200 

50 

TATTCACAGA CTGATTGGAA TTCTTTCTGT TGAAAAGCAC ACACAATAAA GAACCCCTCG 1260 

TTAGCCTTCC TCTGATTTAC ATTCAACTCT GATCCCGGGG CCTTAGGTTT GACATGGGAG 1320 

55 GTGGGAGGAA GATAGCGCAT ATATTTGCAG TATGAACTAT TGCCTCTGGG ACGTTGrGAG 1380 

GAATTGTGCT TTCACCAGAA TTTCTAAGGA TTTCTGGCTT AAATATCACC TAGCCTGTGG 1440 

TAATTTTTTT TCCCT 1455 

60 
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(2) INPORMATICH* FX>R SEQ ID NO: 267: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) liEMGTH: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLOGY: linear* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

CGCCTGCAGT ACCGGTCCGG AATTCCCGGG TCGACCCACG CGTCGCTGAC CCAGGAGAAG 60 

15 

CTGCCTGTCT ACATCAGCCT GGGCTGCAGC GCGCTGCCGC CGCGGGGCCG GCAGCTGAAC 120 

TATGTGCTCT TCAGGGCGGG CACCGTGTTG CATTCATCTT TGTACCCCCA GCATCTAGCA 180 

20 GTGTTGGCAT GTAGTAGGCA CTCAAGAAAT GTGTGTTGAA TGAACGATGC CTGTGACAAG 240 

CAAGCGGACT TTATTCTTTC CTGACCCTTG CTCCTATGAC ACACCTCCTC CTQACTGCCA 300 

CTGTCACTCC TTCAGAGCAG AACTCCTCTA GGGAACCTGG ATGOGAAACA GCCATGGCCA 360 

25 

AGGACATCCT GGGTGAAGCA GGGCTACACT TTGATGAACT GAACAAGCTG AGGGTGTTGG 420 

ACCCAGAGGT TACCCAGCAG ACCATAGAGC TGAAGGAAGA GTGCAAAGAC TTTGTGGACA 480 

30 AAATTGGCCA GTTTCAGAAA ATAGTTGGTG GTTTAATTGA GCTTGTTGAT CAACTTGCAA 540 

AAGAAGCAGA AAATGAAAAG ATGAAGGCCA TCGGTGCTCG GAACTTGCTC AAATCTATAG 600 

CAAAGCAGAG AGAAGCTCAA CAGCAGCAAC TTCAAGCCCT AATAGCAGAA AAGAAAATGC 660 

35 

AGCTW3AAAG GTATCGQGTT. GAATATG7AG_Cm . ... J20_ 

AATTTATTGA CCAATTTATT TTTCAGAAAT GAACTGAAAA TTTCGCTTTT ATAGTAGGAA 780 

40 GGCAAAACAA AAAAAAGCCT CTCAAAACCA AAAAAACCTC TGTAGCATTC CAGCGGCTTG 840 

ACCAATGACC TATGTCACAA GAGGTGGCGT GTAAGGAATG CAGCCCCCTG AAGACAGCAC 900 

TACAAGTCTG GGQGAGCCAG TTTTAACATC AGTGCACAGC TGCTGCTGGT GGCCCTGCAG 960 

45 

TGTACGTTCT CACCTCTTAT GCTTAGTTGG AACTAAGCAG TTTGTAAACT TTCATCCTTT 1020 

TTTTTGTAAA TTCACAAAGC TTTGGAAGGA GARGCAATAA ATTTTTGKTT TCNAAATGGC 1080 

50 TTGATG 1086 



55 <2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 268: 

5 GGCACGGGAG CAGCCGGGCT GGTCCTGCTG CGAGCCGGCG GCCCGGAGTG GGGCGGCGGA 60 

GCAAACATGA ACGTTGGAGT TGCCCACAGT GAAGTGAATC CAAATACCCG TGTCATGAAC 120 

AGCCGGGGTA TGTGGCTGAC ATATGCATTG GGAGTTQGCT TGCTTCATAT TC?rCTTACTC 180 

10 

AGCATTCCCT TCTTCAGTGT TCCTGTTGCT TGGACTTTAA CAAATATTAT ACATAATCTG 240 

GGGATGTACG TATTTTTGCA TGCAGTGAAA GGAACACCTT TCGAAACTCC TGACCAGGGT 300 

15 AAAAGCAAGG CTCCTAACTC ATTGGGAACA ACTGGACTAT GGAGTACAGT TTACATCTTC 360 

ACGGAAGTTT TTCACAATTT CTCCAATAAT TCTATATTTT CTGQCAAGTT TCTATACGAA 420 

GTATGATCCA ACTCACITCA TCCTAAACAC AGCTTCTCTC CTGAGTGTAC TAATTCCCAA 480 

20 

AATGCCACAA CTACATGGTG TTCGGATCTT TGGAATTAAT AAC3TATTGAA ATGTTTTGAA 540 

ACTGAAAAAA AATTTTACAG CTACTGAATT TCTTATAAGG AAGGAGTGGT TAGTAAACTG 600 

25 CACTGTTTCT CTGATAATGT GAAATGAGAA GTATTTACAT TGGAGGGCCA ATGGCTGGTC 660 

CTTCAAGTGC TGTTTTGAAG TGCT^GATTTC CATTAAATGA TGCCTCTGTT TAATACACCT 720 

GGTACATTTC TGAAGAGGGG CTTTATAAGC AGGCTGQGCA GGCCCAGCTT ATAAGTTAAA 780 

30 

GGGCATCACA GTGAGGGTGT AGTAGATAAA TTCAAGGAAA TAAGAGATTT GTAAGAAACT 840 

AGGACCAGCT TAACTTATAA TGAATGGGCA TTGTGTTAAG AAAAGAACAT TTCCAGTCAT 900 

35 TCAGCTGTGG TTATTTAAAG CAGACTTACA TGTAAACCGG AATCCTCTCT ATACAAGTTT 960 

ATTAAAGATT ATTTTTATTA CCGTAAAAAA AAAAAAAAAA AAA 1003 

40 

(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1234 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ATCAGCATCT ACAAGTAGCA TATTTTGGAT GGTGTTTGTG TGCTACTTCA AAGTAACTAG 60 

GAAAAAATAA TCCTCGCAAC ACAGGTACCT TGTCATGTCA GAATTGGGGG TGTTAGGTTG 120 

55 

CCAGTTGTAT CAGTGTTGAT TCATTTCATT ACTTCCTACA GAGCAAACAT GAACGTTGGA 180 

GTTGCCCACA GTGAAGTGAA TCCAAATACC CGTGTCATGA ACAGCCGQGG TATGTGGCTG 240 

60 ACATATGCAT TGGGAGTTGG CTTGCTTCAT ATTGTCTTAC TCAGCATTCC CTTCTTCAGT 300 
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GTTCCTGTTG CTTGGACTTT AACAAATATT ATACATAATC TGGGGATCTA CGTATTTTTG 360 

CATGCAGTGA AAGGAACACC TTTCGAAACT CCTGACCAGG GTAAAGCAAG GCTCCTAACT 420 

CATTQGGAAC AACTGGACTA TQGAGTACAG TTTACATCTT CACGGAAGTT TTTCACAATT 480 

TCTCCAATAA TTCTATATTT TCTGGCAAGT TTCTATACGA AGTATGATCC AACTCACTTC 540 

ATCCTAAACA CAGCTTCTCT CCTGAGTGTA CTAATTCCCA AAATGCCACA ACTACATGGT 600 

GTTCGGATCT TTGGAATTAA TAAGTATTGA AATGTTTTGA AACTGAAAAA AAATTTTACA 660 

GCTACTGAAT TTCTTATAAG GAAGGAGTGG TTAGTAAACT GCACTC5TTTC TSTGATAATG 720 

TGAAATGAGA AGTATTTACA TTGGAGGGCC AATGGCTGGT CCTTCAAGTG CICTTTTGAA 780 

GTGCAGATTT CCATTAAATG ATGCCTCTGT TTAATACACC TGGTACATTT CTGAAGAGGG 840 

GCTTTATAAG CARGCTGGGC AGGCCCAGCT TATAAGTTAA AGGGCATCAC AGTGAGGGTG 900 

TAGTAGATAA ATTCAAGGAA ATAAGAGATT TGTAAGAAAC TAGGACCAGC TTAACTTATA 960 

ATGAATGGGC ATTGTGTTAA GAAAAGAACA TTTCCAGTCA TTCAGCTGTG GTTATTTAAA 1020 

GCAGACTTAC ATGTAAACCG GAATCCTCTC TATACAAGTT TATTAAAGAT TATTTTTATT 1080 

acx:rtacata TTTCKCTTGT TTTATGTAAG YGGATGTATA TCCTCTTGTT TTATACAAGC 1140 

CAGTTCCCAC TTATGAGGGT ACTTTTTTGG TTTTGCTGGG CTTAATATTG TGTATTGGTC 1200 

AATGAGGCCA TTTTTACANT TATTAACGTT ACAG 1234 
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(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

NGAGGTGCGT TCTGAGCCGT CTGTCCTGCG CCAAGATGCT TCAAAGTATT ATTAAAAACA 60 

TATGGATCCC CATGAAGCCC TACTACACCA AAGTTTACCA QQAGATTTGG ATAGGAATGG 120 

GGCTGATGGG CTTCATCGTT TATAAAATCC GGGCTGCTGA TAAAAGAAGT AAGGCTTTGA 180 

AAGCTTCAGC GCCTGCTCCT GGTCATCACT AACCAGATTT ACTTGGAGTA CATGTGAAAG 240 

AAAACGTCAG TCTGCCTGTA AATTTCAGCA AGCCGTGTTA GATGGGGAGC GTGGAACGTC 300 

ACTGTACACT TGTATAAGTA CCGTTTACTT CATGGCATGA ATAAATGGAT CTGTGAGATG 360 

CACTGCTACC TGGTACTGCT TTCAGTGTGT TCCCCCTCAG CCCTCCGGCG TGTCAGGCAT 420 



